Bitcoin Forum
November 03, 2024, 04:08:00 PM *
News: Latest Bitcoin Core release: 28.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: « 1 2 3 [4]  All
  Print  
Author Topic: Plagiarism: Where Do We Draw the Line?  (Read 948 times)
PrimeNumber7
Copper Member
Legendary
*
Offline Offline

Activity: 1652
Merit: 1901

Amazon Prime Member #7


View Profile
September 09, 2021, 01:25:40 PM
 #61

The problem is that it is really not possible to check every new post for plagiarism because the cost of checking an additional post will grow for every additional post written. For example, if there are 100 posts that exist on the forum, the cost of checking a new post against all existing posts is 100 units. Once there are 1000 posts on the forum, the cost of checking a single new post against all existing posts is 1000 units. For each additional post made, it costs one additional unit to check a single additional post. This is obviously not sustainable.
Thanks for chiming in. Discussing these things is always interesting. You are talking about the time complexity of such a search and match algorithm.
Right. As the number of posts increase, so does the amount of time it takes to check one additional post.

You'd first need a set of master data with all possible 6 word snippets of text from all the existing posts. (provided someone is copying only from existing Bitcoin posts). This would then have to be compared with the set of snippets formed from every new post. While this could be done, I believe the space and memory requirements would be pretty huge.
You are describing one way in which all current posts could be checked for plagiarism (at least plagiarism by copying other users' posts).

What you describe is missing two things. Existing posts would not be checked for plagiarism, and if a post is written in the future and is subsequently plagiarized, the setup you describe would not catch it.
suchmoon
Legendary
*
Offline Offline

Activity: 3836
Merit: 9064


https://bpip.org


View Profile WWW
September 09, 2021, 01:53:24 PM
 #62

You'd first need a set of master data with all possible 6 word snippets of text from all the existing posts. (provided someone is copying only from existing Bitcoin posts). This would then have to be compared with the set of snippets formed from every new post. While this could be done, I believe the space and memory requirements would be pretty huge. Though, doesn't google do it for like, all of the internet? And Altavista used to do it at one time. Now, google has humungous capacity of course but I don't think that the old sites like Altavista had those.

There are clever indexing methods that make this kind of search relatively quick and also can match slight variations, even word spinning to an extent.

I'm not really sure what you're proposing (checking your posts against all other posts? why exactly?) but the plagiarism problem in general is not a technical one. We can use all sorts of tricks to catch plagiarists and they'll just make their posts more and more obscure - copying from outside sources, translating from other languages, etc - as long as there is a financial incentive to do so.
nutildah
Legendary
*
Offline Offline

Activity: 3164
Merit: 8544


Happy 10th Birthday to Dogeparty!


View Profile WWW
September 11, 2021, 12:48:06 AM
 #63

We can use all sorts of tricks to catch plagiarists and they'll just make their posts more and more obscure - copying from outside sources, translating from other languages, etc - as long as there is a financial incentive to do so.

The lengths some people will go to in order to avoid having an original thought are pretty amazing. I mean, how hard is it to just think of a sentence in your head and transcribe it into a post?

The other day a WO semi-regular was banned for copying sports articles written in Italian and Google Translating them. He was a legendary too. Possibly the worst part is he's not even Italian.

▄▄███████▄▄
▄██████████████▄
▄██████████████████▄
▄████▀▀▀▀███▀▀▀▀█████▄
▄█████████████▄█▀████▄
███████████▄███████████
██████████▄█▀███████████
██████████▀████████████
▀█████▄█▀█████████████▀
▀████▄▄▄▄███▄▄▄▄████▀
▀██████████████████▀
▀███████████████▀
▀▀███████▀▀
.
 MΞTAWIN  THE FIRST WEB3 CASINO   
.
.. PLAY NOW ..
Pages: « 1 2 3 [4]  All
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!