Bitcoin Forum
May 03, 2024, 03:02:59 PM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 2 3 [4]  All
  Print  
Author Topic: Plagiarism: Where Do We Draw the Line?  (Read 876 times)
PrimeNumber7
Copper Member
Legendary
*
Offline Offline

Activity: 1624
Merit: 1899

Amazon Prime Member #7


View Profile
September 09, 2021, 01:25:40 PM
 #61

The problem is that it is really not possible to check every new post for plagiarism because the cost of checking an additional post will grow for every additional post written. For example, if there are 100 posts that exist on the forum, the cost of checking a new post against all existing posts is 100 units. Once there are 1000 posts on the forum, the cost of checking a single new post against all existing posts is 1000 units. For each additional post made, it costs one additional unit to check a single additional post. This is obviously not sustainable.
Thanks for chiming in. Discussing these things is always interesting. You are talking about the time complexity of such a search and match algorithm.
Right. As the number of posts increase, so does the amount of time it takes to check one additional post.

You'd first need a set of master data with all possible 6 word snippets of text from all the existing posts. (provided someone is copying only from existing Bitcoin posts). This would then have to be compared with the set of snippets formed from every new post. While this could be done, I believe the space and memory requirements would be pretty huge.
You are describing one way in which all current posts could be checked for plagiarism (at least plagiarism by copying other users' posts).

What you describe is missing two things. Existing posts would not be checked for plagiarism, and if a post is written in the future and is subsequently plagiarized, the setup you describe would not catch it.
1714748579
Hero Member
*
Offline Offline

Posts: 1714748579

View Profile Personal Message (Offline)

Ignore
1714748579
Reply with quote  #2

1714748579
Report to moderator
1714748579
Hero Member
*
Offline Offline

Posts: 1714748579

View Profile Personal Message (Offline)

Ignore
1714748579
Reply with quote  #2

1714748579
Report to moderator
I HATE TABLES I HATE TABLES I HA(╯°□°)╯︵ ┻━┻ TABLES I HATE TABLES I HATE TABLES
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
1714748579
Hero Member
*
Offline Offline

Posts: 1714748579

View Profile Personal Message (Offline)

Ignore
1714748579
Reply with quote  #2

1714748579
Report to moderator
suchmoon
Legendary
*
Offline Offline

Activity: 3654
Merit: 8922


https://bpip.org


View Profile WWW
September 09, 2021, 01:53:24 PM
 #62

You'd first need a set of master data with all possible 6 word snippets of text from all the existing posts. (provided someone is copying only from existing Bitcoin posts). This would then have to be compared with the set of snippets formed from every new post. While this could be done, I believe the space and memory requirements would be pretty huge. Though, doesn't google do it for like, all of the internet? And Altavista used to do it at one time. Now, google has humungous capacity of course but I don't think that the old sites like Altavista had those.

There are clever indexing methods that make this kind of search relatively quick and also can match slight variations, even word spinning to an extent.

I'm not really sure what you're proposing (checking your posts against all other posts? why exactly?) but the plagiarism problem in general is not a technical one. We can use all sorts of tricks to catch plagiarists and they'll just make their posts more and more obscure - copying from outside sources, translating from other languages, etc - as long as there is a financial incentive to do so.
nutildah
Legendary
*
Offline Offline

Activity: 2982
Merit: 7968



View Profile WWW
September 11, 2021, 12:48:06 AM
 #63

We can use all sorts of tricks to catch plagiarists and they'll just make their posts more and more obscure - copying from outside sources, translating from other languages, etc - as long as there is a financial incentive to do so.

The lengths some people will go to in order to avoid having an original thought are pretty amazing. I mean, how hard is it to just think of a sentence in your head and transcribe it into a post?

The other day a WO semi-regular was banned for copying sports articles written in Italian and Google Translating them. He was a legendary too. Possibly the worst part is he's not even Italian.

▄▄███████▄▄
▄██████████████▄
▄██████████████████▄
▄████▀▀▀▀███▀▀▀▀█████▄
▄█████████████▄█▀████▄
███████████▄███████████
██████████▄█▀███████████
██████████▀████████████
▀█████▄█▀█████████████▀
▀████▄▄▄▄███▄▄▄▄████▀
▀██████████████████▀
▀███████████████▀
▀▀███████▀▀
.
 MΞTAWIN  THE FIRST WEB3 CASINO   
.
.. PLAY NOW ..
Pages: « 1 2 3 [4]  All
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!