Plagiarism: Where Do We Draw the Line?

Bitcoin Forum

November 03, 2024, 04:08:00 PM

Welcome, Guest. Please login or register.

News: Latest Bitcoin Core release: 28.0 [Torrent]

Home

Help

Search

Login

Register

More

Bitcoin Forum > Other > Meta > Plagiarism: Where Do We Draw the Line?

Pages: « 1 2 3 [4] All

« previous topic next topic »

Author

Topic: Plagiarism: Where Do We Draw the Line? (Read 948 times)

PrimeNumber7

Copper Member
Legendary

Offline

Offline

Activity: 1652
Merit: 1901

Amazon Prime Member #7

Re: Plagiarism: Where Do We Draw the Line?

September 09, 2021, 01:25:40 PM

#61

Quote from: amishmanish on September 08, 2021, 05:18:20 AM

Quote from: PrimeNumber7 on September 07, 2021, 05:56:48 AM

The problem is that it is really not possible to check every new post for plagiarism because the cost of checking an additional post will grow for every additional post written. For example, if there are 100 posts that exist on the forum, the cost of checking a new post against all existing posts is 100 units. Once there are 1000 posts on the forum, the cost of checking a single new post against all existing posts is 1000 units. For each additional post made, it costs one additional unit to check a single additional post. This is obviously not sustainable.

Thanks for chiming in. Discussing these things is always interesting. You are talking about the time complexity of such a search and match algorithm.

Right. As the number of posts increase, so does the amount of time it takes to check one additional post.

Quote from: amishmanish on September 08, 2021, 05:18:20 AM

You'd first need a set of master data with all possible 6 word snippets of text from all the existing posts. (provided someone is copying only from existing Bitcoin posts). This would then have to be compared with the set of snippets formed from every new post. While this could be done, I believe the space and memory requirements would be pretty huge.

You are describing one way in which all current posts could be checked for plagiarism (at least plagiarism by copying other users' posts).

What you describe is missing two things. Existing posts would not be checked for plagiarism, and if a post is written in the future and is subsequently plagiarized, the setup you describe would not catch it.

suchmoon

Legendary

Offline

Offline

Activity: 3836
Merit: 9064

https://bpip.org

WWW

Re: Plagiarism: Where Do We Draw the Line?

September 09, 2021, 01:53:24 PM

#62

Quote from: amishmanish on September 08, 2021, 05:18:20 AM

You'd first need a set of master data with all possible 6 word snippets of text from all the existing posts. (provided someone is copying only from existing Bitcoin posts). This would then have to be compared with the set of snippets formed from every new post. While this could be done, I believe the space and memory requirements would be pretty huge. Though, doesn't google do it for like, all of the internet? And Altavista used to do it at one time. Now, google has humungous capacity of course but I don't think that the old sites like Altavista had those.

There are clever indexing methods that make this kind of search relatively quick and also can match slight variations, even word spinning to an extent.

I'm not really sure what you're proposing (checking your posts against all other posts? why exactly?) but the plagiarism problem in general is not a technical one. We can use all sorts of tricks to catch plagiarists and they'll just make their posts more and more obscure - copying from outside sources, translating from other languages, etc - as long as there is a financial incentive to do so.

nutildah

Legendary

Offline

Offline

Activity: 3164
Merit: 8544

Happy 10th Birthday to Dogeparty!

WWW

⇾ Re: Plagiarism: Where Do We Draw the Line?

September 11, 2021, 12:48:06 AM

#63

Quote from: suchmoon on September 09, 2021, 01:53:24 PM

We can use all sorts of tricks to catch plagiarists and they'll just make their posts more and more obscure - copying from outside sources, translating from other languages, etc - as long as there is a financial incentive to do so.

The lengths some people will go to in order to avoid having an original thought are pretty amazing. I mean, how hard is it to just think of a sentence in your head and transcribe it into a post?

The other day a WO semi-regular was banned for copying sports articles written in Italian and Google Translating them. He was a legendary too. Possibly the worst part is he's not even Italian.

▄▄███████▄▄
▄███████████████▄
▄███████████████████▄
▄████▀▀▀▀███▀▀▀▀██████▄
▄██████████▀███▄█▀██████▄
█████████▄███▄███████████
███████████▄█▀███████████
███████████▀███▀█████████
▀██████▄█▀███▄██████████▀
▀██████▄▄▄▄███▄▄▄▄████▀
▀███████████████████▀
▀███████████████▀
▀▀███████▀▀

.
MΞTAWIN ^│ ^{THE FIRST WEB3 CASINO}

. ▄▄███████▄▄ ▄███████████████▄ ███▀██████▀░░░▀▄███ ████▄░░▀▀▀░░░░░░▄████ ████▄░░░░░░░░░░▄█████ █████▄▄░░░░░░░▄██████ ███▄▄░░░░▄▄▄███████ ▀███████████████▀ ▀▀███████▀▀ TWITTER

. ▄▄███████▄▄ ▄███████████████▄ ███████████▀▀▀░░███ ██████▀▀▀░░▄▄▀░░▐████ ███▄▄░░░▄█▀░░░░░█████ ██████▌▐▀░░░░░░▐█████ ██████▌░▄█▄▄░░█████ ▀███████████████▀ ▀▀███████▀▀ TELEGRAM

. ▄▄███████▄▄ ▄███████████████▄ ████▀▀░░▀▀▀░░▀▀████ ████▀░░░░░░░░░░░▀████ ███▌░░░██░░░██░░░▐███ ███▌░░░░░░░░░░░░░▐███ ███▄░▀█▄▄▄▄▄█▀░▄███ ▀███████████████▀ ▀▀███████▀▀ DISCORD

.
..^PLAY NOW..

Pages: « 1 2 3 [4] All

Bitcoin Forum > Other > Meta > Plagiarism: Where Do We Draw the Line?

« previous topic next topic »

Jump to:

Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines