I'd just have to write a side-script to prevent users from just wrapping their messages in ["quote"] tags. Quoted text isn't counted for payment for signature spammers *, so they're unlikely to hide their plagiarism that way. * Assuming the campaign has a campaign manager that does at least some of his job.
|
|
|
At least they have something to lose now, their 0.002BTC won't come back when they get banned.
|
|
|
But I see the signature campaigns do not accept copper members. Since this hasn't been answered yet: it depends on the campaign manager. I would accept Copper Members at Member rate, but I would also require them to have earned 35 Merit, to ensure their post quality is high. I truly feel sorry for you if $12 can buy you food for a week Why feel sorry? I wish I could buy a week's worth of food for $12! a lot of the members here are very rude to poor 3rd worlders, I'm sure others here will berate you for not earning a single merit. Only a few members are very rude. Most real users hate spam though, no matter where you come from.
|
|
|
Tinker a little with the number of words and the threshold for detection of duplicates, and you're probably almost there for a large share of the copy-pasta spam. I'm more worried about the very high number of positive results. Let me play around a bit with yesterday's data, from post 45850092 up to post 45893434. My scraper caught 43184 out of 43343 posts (it misses some burst posts). This is after the new Merit requirements, so there's less spam already. I'll show the 50 most used posts (raw HTML excluding quotes; the number at the start of each line shows how often they appear). Those posts are exactly the same each time they were posted: 288 (post was empty or only a quote) 162 Do you have a telegram channel? 91 Proof of Authentication:<br />Joined Telegram Campaign 45 Bump 25 bump 24 microguy talks to himself just like he trades himself just like he lol himself <img src="https://bitcointalk.org/Smileys/default/grin.gif" alt="Grin" border="0" /> <img src="https://bitcointalk.org/Smileys/default/grin.gif" alt="Grin" border="0" /> <img src="https://bitcointalk.org/Smileys/default/grin.gif" alt="Grin" border="0" /><br /><br />sounds like the shytcoin is showing its age like microguy is <span style="font-size: 99pt !important; line-height: 1.3em;">🤔</span><br /><br />sounds like a igotspots shytcoin scam checkpoint dysfunction still better than btc right<br /><br /><span style="color: blue;"><span style="font-size: 90pt !important; line-height: 1.3em;">Whats in your wallet</span></span><br /><br /><span style="font-size: 90pt !important; line-height: 1.3em;"><span style="color: brown;"><a class="ul" href="https://imgur.com/rPLBZVM">https://imgur.com/rPLBZVM</a></span></span> 23 <div align="center"><b><br />For a more general context on our seed round, and the reasons for this funding round please read our <a class="ul" href="https://[Suspicious link removed">/i3ufCd]medium article</a></b> 20 <div align="center"><span style="font-size: 20pt !important; line-height: 1.3em;"><b>Hello Everyone, GOeureka are live now with Bounty Campaign.<br /> Please follow given link to participate</b></span> 19 hi<br />i noticed you deleted you telegram account recently<br />why?<br />i am still waiting the letter and when it arrives how can i contact you?<br />please contact me at @AmbrogioOrfeu on telegram 18 IMPORTANT ANNOUNCEMENTS ABOUT INBOT FUTURE :<br /><br />1. Our revenue for first 6 months was more than whole 2017!<br />2. We are hiring Partner Managers and Business Operations Managers.<br />3. We are moving InToken from Ethereum to Stellar blockchain.<br />4. We will list InToken without an ICO. 17 hello everyone <br />here im talking about a new cryptocurrency which is THUNDERSTAKE (TSC) .TSC PoS staking rewards: 900% APR fixed, every block number dividable by 10 is a superblock with double APR (1800 %) .<br />we have made products with TSC logo which you can buy from our website with TSC coin as payment.TSC is live on CMC and 5 exchanges, Cyptobridge,mercatox,Stokes.exchange, bitrex and escodex .<br />here is our website link <a class="ul" href="https://thunderstake.com">https://thunderstake.com</a> and discord link : <a class="ul" href="https://discord.gg/wmu9Zcx">https://discord.gg/wmu9Zcx</a> you can get everything from here have a look 16 Up 16 Proof of Authentication:<br />Joined Telegram Campaign<br /> 15 up 14 week #1<br />Reddit Campaign<br />Reddit name: <br />Reddit user Url: <br />Like any post on Subreddit (list with links to post):<br />1.<br /> 14 #proof:<br />Twitter username:@cryptonerdd<br />Telegram username:@cryptonerdd<br />ERC20 address:0x51494b94939D2C8353d069206887687C40eD92B9<br /> 13 microguy talks to himself just like he trades himself just like he lol himself <img src="https://bitcointalk.org/Smileys/default/grin.gif" alt="Grin" border="0" /> <img src="https://bitcointalk.org/Smileys/default/grin.gif" alt="Grin" border="0" /> <img src="https://bitcointalk.org/Smileys/default/grin.gif" alt="Grin" border="0" /><br /><br />sounds like the shytcoin is showing its age like microguy is <span style="font-size: 99pt !important; line-height: 1.3em;">🤔</span><br /><br />sounds like a igotspots shytcoin scam checkpoint dysfunction still better than btc right<br /><br /><span style="color: blue;"><span style="font-size: 70pt !important; line-height: 1.3em;">Whats in your wallet</span></span><br /><br /><span style="font-size: 90pt !important; line-height: 1.3em;"><span style="color: brown;"><a class="ul" href="https://imgur.com/rPLBZVM">https://imgur.com/rPLBZVM</a></span></span> 12 Bitcointalk username: aloha0001<br />Forum rank: member<br />Posts count: 255<br />ETH address: 0x04ddhA7Bb8b08af5E6866C1efc3rehe54a2859E6<br /> 12 <div align="center"><b><span style="font-size: 15pt !important; line-height: 1.3em;"><span style="color: orange;">Update</span></span></b> 11 reserved 11 Twitter<br /><br />Retweets<br />1.<a class="ul" href="https://mobile.twitter.com/MaestroProject1/status/1003536243370545152">https://mobile.twitter.com/MaestroProject1/status/1003536243370545152</a><br />2.<a class="ul" href="https://mobile.twitter.com/MaestroProject1/status/1003824945430843393">https://mobile.twitter.com/MaestroProject1/status/1003824945430843393</a><br />3.<a class="ul" href="https://mobile.twitter.com/MaestroProject1/status/1004547290063765508">https://mobile.twitter.com/MaestroProject1/status/1004547290063765508</a><br />4.<br />5.<br /><br />Tweets<br />1.<a class="ul" href="https://mobile.twitter.com/amanda_septiasa/status/1003681341915869184">https://mobile.twitter.com/amanda_septiasa/status/1003681341915869184</a><br />2.<br /> 10 Week #1<br />Facebook<br /><br />Shares + Likes<br /><br />1. <a class="ul" href="https://www.facebook.com/amro.trikid/posts/10212205125466225">https://www.facebook.com/amro.trikid/posts/10212205125466225</a><br />2. <a class="ul" href="https://www.facebook.com/amro.trikid/posts/10212210015988485">https://www.facebook.com/amro.trikid/posts/10212210015988485</a><br />3. <a class="ul" href="https://www.facebook.com/amro.trikid/posts/10212219066614745">https://www.facebook.com/amro.trikid/posts/10212219066614745</a><br />4. <a class="ul" href="https://www.facebook.com/amro.trikid/posts/10212228785097701">https://www.facebook.com/amro.trikid/posts/10212228785097701</a><br />5. <a class="ul" href="https://www.facebook.com/amro.trikid/posts/10212228787657765">https://www.facebook.com/amro.trikid/posts/10212228787657765</a><br /> 10 WEEK#1<br />Facebook Campaign<br />Facebook Link: <a class="ul" href="https://facebook.com/deerey.area">https://facebook.com/deerey.area</a><br />Friends: 1100<br /><br />Post:<br /><br />Shared:<br /> 10 Twitter Campaign <br />Twitter user Url: <a class="ul" href="https://twitter.com/4LUtr1qGRLB">https://twitter.com/4LUtr1qGRLB</a> <br />Repost and Like any post on Twitter (list with links): <br /><a class="ul" href="https://twitter.com/bitflipcc/status/10101578403">https://twitter.com/bitflipcc/status/10101578403</a><br /> 10 Bitcointalk account URL : <br />TELEGRAM username: @zlo2323<br />language: Korean<br />Rank: Jr.Member<br />Eth address: 0xaE0304fd2b399c790170aA6Ea6A1d6E78713f96<br /> 10 <br />test 10 <br /> 10 #PROOF OF AUTHENTICATION POST<br />Joined Twitter Campaign<br />Bitcointalk Username: Dollar1980<br />Telegram Username: @TahsibGhurair<br />Twitter Username: @Tahsib_Ghurair<br />Twitter Account Url: <a class="ul" href="https://twitter.com/Tahsib_Ghurair">https://twitter.com/Tahsib_Ghurair</a><br /> 9 Native language: Russian <br />Bitcointalk username: Sabergas1w7 <br />Profile link: <a class="ul" href="https://bitcointalk.org/index.php?action=profile;u=161465763">https://bitcointalk.org/index.php?action=profile;u=161465763</a> <br />Part of the bounty you apply for: ANN <br />Experience: NO <br />Telegram: <a class="ul" href="https://t.me/Sadbis1g7">https://t.me/Sadbis1g7</a> <br />Email: <a href="mailto:gaerhe5ra@mail.ru">gaerhe5ra@mail.ru</a> <br />Ethereum address: 0x91D8f2e4hjdEC122568f4c2cd5D14a362glk561F <br />Please PM me if you accept. <br /> 9 #Proof of Authentication<br /><br />Campaign : Telegram & Twitter <br />Bitcointalk Username: notnotok<br />Telegram Username : @khalidalbudoor<br />Twitter Account Link: <a class="ul" href="https://twitter.com/khalidalbudoor7">https://twitter.com/khalidalbudoor7</a><br />Twitter Username: @khalidalbudoor7<br /> 9 #PROOF OF AUTHENTICATION POST<br />Joined Twitter Campaign<br />Bitcointalk Username: ExcellentOffer86<br />Twitter Account Url: <a class="ul" href="https://twitter.com/Saeed_Imtiaz1">https://twitter.com/Saeed_Imtiaz1</a><br />Telegram Username: @Saeed_Imtiaz1<br /> 8 Week #1<br />Twitter<br /><br />Retweets<br />1. <a class="ul" href="https://twitter.com/MaestroProject1/status/998832211800412160">https://twitter.com/MaestroProject1/status/998832211800412160</a><br />2. <a class="ul" href="https://twitter.com/MaestroProject1/status/998839809895350272">https://twitter.com/MaestroProject1/status/998839809895350272</a><br />3. <a class="ul" href="https://twitter.com/MaestroProject1/status/999005931881906176">https://twitter.com/MaestroProject1/status/999005931881906176</a><br />4. <a class="ul" href="https://twitter.com/MaestroProject1/status/999036079238868992">https://twitter.com/MaestroProject1/status/999036079238868992</a><br />5. <a class="ul" href="https://twitter.com/MaestroProject1/status/999043596345950208">https://twitter.com/MaestroProject1/status/999043596345950208</a><br /><br />Tweets<br />1. <a class="ul" href="https://twitter.com/hellofancydei/status/1004721044937105413">https://twitter.com/hellofancydei/status/1004721044937105413</a><br />2. <a class="ul" href="https://twitter.com/hellofancydei/status/1004721411192049667">https://twitter.com/hellofancydei/status/1004721411192049667</a> 8 Facebook<br />Week #1<br /><br />Twitter Profile Link: <a class="ul" href="https://twitter.com/CREoday_ru">https://twitter.com/CREoday_ru</a><br />Like and Retweet:<br />1. <a class="ul" href="https://twitter.com/medXe1/status/961630808724459520">https://twitter.com/medXe1/status/961630808724459520</a><br />2. <a class="ul" href="https://twitter.com/medXe1/status/962393102601412608">https://twitter.com/medXe1/status/962393102601412608</a><br />3. <a class="ul" href="https://twitter.com/medXe1/status/962767627113455616">https://twitter.com/medXe1/status/962767627113455616</a><br />4. <a class="ul" href="https://twitter.com/medXe1/status/962768328770146309">https://twitter.com/medXe1/status/962768328770146309</a><br />5. <a class="ul" href="https://twitter.com/medXe1/status/975583417281712128">https://twitter.com/medXe1/status/975583417281712128</a><br /><br />Facebook Profile Link: <a class="ul" href="https://www.facebook.com/ar.amur.ru">https://www.facebook.com/ar.amur.ru</a><br />Like and Share:<br />1. <a class="ul" href="https://www.facebook.com/ar.amur.ru/posts/597475630588609">https://www.facebook.com/ar.amur.ru/posts/597475630588609</a><br />2. <a class="ul" href="https://www.facebook.com/ar.amur.ru/posts/597613640574808">https://www.facebook.com/ar.amur.ru/posts/597613640574808</a><br />3. <a class="ul" href="https://www.facebook.com/ar.amur.ru/posts/597994343870071">https://www.facebook.com/ar.amur.ru/posts/597994343870071</a><br />4. <a class="ul" href="https://www.facebook.com/ar.amur.ru/posts/598519390484233">https://www.facebook.com/ar.amur.ru/posts/598519390484233</a><br />5. <a class="ul" href="https://www.facebook.com/ar.amur.ru/posts/599002237102615">https://www.facebook.com/ar.amur.ru/posts/599002237102615</a><br /> 8 <a href="https://i.imgur.com/QBgno2y.png">https://i.imgur.com/QBgno2y.png</a><br /><br />We invite you to bring your project to <b><a class="ul" href="http://Altmarkets.cc">Altmarkets.cc</a></b>,<br /><br /><br />Add your coin to our exchange by requesting <b><a class="ul" href="https://docs.google.com/forms/d/e/1FAIpQLSejTGyelV8OleqYbGqscdvWrMKsXOp8bCvO4VCtkFqAAJctcg/viewform?usp=send_form">Here</a></b><br /><br /><br />(OPTIONAL) Join us on Discord to speak directly to us about your listing request : <b><a class="ul" href="https://discord.gg/ZhQzy5f">https://discord.gg/ZhQzy5f</a></b><br /><br />Our Fees - <a class="ul" href="https://altmarkets.cc/fees">https://altmarkets.cc/fees</a><br />Listing Policy: <a class="ul" href="https://altmarkets.cc/add_coin">https://altmarkets.cc/add_coin</a> 7 week 1<br /><br />Tweet link : <br />1. <br />2. <br />3. <br /><br />Retweet link : <br />1. <a class="ul" href="https://twitter.com/MaestroProject1/status/10016030348670208">https://twitter.com/MaestroProject1/status/10016030348670208</a><br />2. <br />3. <br />4. <br />5. <br /><br />LIke & share link : <br />1. <a class="ul" href="https://web.facebook.com/coinhunt1/posts/28284343478955285">https://web.facebook.com/coinhunt1/posts/28284343478955285</a><br />2. <br />3. <br />4. <br />5. <br /> 7 Proof of joined post<br />Campaign in which you participate: Linkedin campaign<br />ETH address: 0x02Aft679fd80E9dD51cac1dc5se45f42578fhj64<br /> 7 I want to reserve a signature campaign.<br />BitcoinTalk name: jordarheje89<br />BitcoinTalk profile link: <a class="ul" href="https://bitcointalk.org/index.php?action=profile;u=1866560678;sa=summary">https://bitcointalk.org/index.php?action=profile;u=1866560678;sa=summary</a><br />Eth Address: 0xCd332c24rhehBfa3A9d658D2F33Aheh2eF5689<br /> 7 Bump. 7 <div align="center"><b><span style="font-size: 15pt !important; line-height: 1.3em;"><span style="color: #7e0dbd;">RainCheck | Update</span></span></b> 7 +12000 subcribers on Telegram<br />Come and chat with the Team<br /><a class="ul" href="https://t.me/brodweyrealteam">https://t.me/brodweyrealteam</a><br /> 7 #proof:<br />Twitter username:@cryptonerdd<br />Telegram username:@cryptonerdd<br />ERC20 address:0x51494b94939D2C8353d069206887687C40eD92B9 7 #Proof of Authentication Post Link<br /><br />Twitter Campaign<br />Twitter Account : <a class="ul" href="https://twitter.com/DarinaBovsiktak">https://twitter.com/DarinaBovsiktak</a><br />Facebook Campaign<br />Facebook: <a class="ul" href="https://www.facebook.com/DorianTopz">https://www.facebook.com/DorianTopz</a> 7 #PROOF OF AUTHENTICATION POST<br />Joined Twitter Campaign<br />Bitcointalk Username: ExcellentOffer86<br />Twitter Username: @Saeed_Imtiaz1<br />Twitter Account Url: <a class="ul" href="https://twitter.com/Saeed_Imtiaz1">https://twitter.com/Saeed_Imtiaz1</a><br />Telegram Username: @Saeed_Imtiaz1<br /> 7 ##PROOF OF AUTHENTICATION##<br />Bitcointalk Username: trishaanywhite<br /><br /><br />Joined Campaigns: Twitter<br />Twitter User Name: trishaanywhite<br />Twitter Account Url : <a class="ul" href="https://twitter.com/trishaanywhite">https://twitter.com/trishaanywhite</a><br /><br /><br />Joined Campaigns: Telegram<br />Telegram user Name: @trishaany<br />Telegram Url: <a class="ul" href="https://t.me/trishaany">https://t.me/trishaany</a><br /><br /> 6 TRANSLATION IN INDONESIAN<br />Bitcointalk username: adelaisav <br />Native language: indonesia<br />Email: <a href="mailto:dancukbanget@gmail.com">dancukbanget@gmail.com</a> <br />Telegram: @filarisdianto <br />Part of bounty you apply for : ALL<br />Translation/moderation experience: <a class="ul" href="https://docs.google.com/spreadsheets/d/1Ltym_vuCnAvpGD7F7KnldJtm7wYP8S3sdZ7pdRaK8Jg/htmlview">https://docs.google.com/spreadsheets/d/1Ltym_vuCnAvpGD7F7KnldJtm7wYP8S3sdZ7pdRaK8Jg/htmlview</a><br />ETH address: 0xb02518F08daeb2Ef11a50edB152C59507D0EB2F5<br />Pm me if you need sir 6 Reserve 6 Project looks great but there are tons of projects like this and my question is, how can you be a bit defirrent than other payment system? 6 IMPORTANT ANNOUNCEMENTS ABOUT INBOT FUTURE :<br /><br />1. Our revenue for first 6 months was more than whole 2017!<br />2. We are hiring Partner Managers and Business Operations Managers.<br />3. We are moving InToken from Ethereum to Stellar blockchain.<br />4. We will list InToken without an ICO.<br /> 6 Hi dev,<br />I'm writing to you with an offer of listing at one of the major masternodes monitoring website - <a class="ul" href="http://masternodes.plus">http://masternodes.plus</a> (MasterNodesPlus).<br />You have been selected and approved for listing as recommended masternode coin.<br />To be listed at the website, you can use one of the three offers:<br /><br />Normal listing-up to 24 hours: 0.1BTC<br />Listing an ICO (coin not available on any exchange) up to 6 hours: 0,3BTC<br /><br />You can make your request for the lisitng here:<br /><a class="ul" href="https://masternodes.plus/contact.html">https://masternodes.plus/contact.html</a><br /><br /><br />Regards,<br />Timothy James-Quill<br />\93MNP\94<br /> 6 A request to prospective clients, please post a message on the forum thread first to keep the thread alive and then make a contact using above mentioned contacts for prompt response.<br /><br />--------------------------------------------------------------<br />For users in China/Hong, they can also contact via QQ.<br /><br />QQ: 256447418 The first line is my own description. It's mainly caused by bounty spammers: they quote their own old post, then edit it to add their latest bounty report spam. My scraper catches the posts before they're edited. This doesn't really catch plagiarism, but it catches spam. When you're looking for word phrases to detect plagiarism, you're likely to get even more hits than this. The second entry came from Cidonar, who bumped this thread 162 times. That board shouldn't allow deleting posts within 24 hours, but it does. The user isn't banned, as he deleted the evidence. The third entry ("Proof of Authentication") came from many different users in this thread. I've just reported a few asking to check the thread. The sixth entry ("microguy talks to himself") came from BitCoin ranger, who had 24 posts deleted by moderators. Manually going through this list is a lot of work, while there aren't many posts to report. It's not very effective to do.
|
|
|
And if you have any other Idea to how to get merits .. There's this crazy idea going around this forum to earn merits. It's very well hidden from shitposters, they don't have a clue it even exists! It's really a groundbreaking method, which is actively being used by thousands of users to earn Merit. I'm not sure if I should tell you though. Should I? Okay, here it goes, but I'll hide it from prying eyes, so you need to select the text to be able to read it. Here it goes: STOP SHITPOSTING AND START CONTRIBUTING!What is this spam?
|
|
|
A short reduce in spam posts is recognizable Maybe the Merit requirement can be raised again when many Jr. Members continue spamming. At some point they'll run out of sMerit. This is a serious problem:
|
|
|
Meta - one of the few safe harbors from spam has be decimated by threads about merit. I've never had to report so many posts in one of my threads in Meta before!
|
|
|
That way other people might think twice before buying or exchanging 1 merit to get out of the "newbie hell" (source needed for the expression "newbie hell"... I don't remember who came up with the term, but it's my new go-to expression) I saw it first used by MagicSmoker, but the oldest reference I can find is Omega0255 more than 7 years ago. I might make a thread as soon as friday's merit statistics become published (i don't want to make a thread for a script that can't be used right away)... I'll update mine in about 48 hours. Theymos' original data will be a couple hours earlier. Sorry to have hijacked your thread, i'll try not to do it again No problem, I started it
|
|
|
My proposition was to filter out potential abusers using my script, then manually verify the list and only punish those that received merits for posts that are complete garbage... This leaves the possibility the shitposter received Merit without buying it. The sender on the other hand is for sure guilty in this scenario. Your project deserves it's own thread!
|
|
|
Moderators bans a man who violated the rules and its forbidden to create a new account on the forum for him. I got a ban for something I didn't do and now I cant write on the forum That's one of the risks of buying an account. You won't be unbanned.
|
|
|
Could DT users review The only evidence I see, is you buying trust spam.
|
|
|
You missed my favourite one: Some people just shouldn't be on here. If they post continuous spam and haven't bothered to read the rules then they don't deserve anybodies time.
If someone came to your house with dirty shoes (ignoring the please take off your shoes sign) turned on the TV loudly, wrote on your furniture, slowed down your internet by downloading constantly, invited their friends over and ate from your fridge empty and was there purely for the "free food and drink". Would you tell them what they were doing wrong or tell them to go away in a less than polite way ?
|
|
|
I think @mocacinno will be able to scraped all the newbies who ranked up to Jr. Member from yesterday to today, but he won't(at least take lot of scraping) catch up the Jr. members who demoted yesterday but ranked up again within the same day. Scraping all accounts that received Merit isn't much work, especially if you only scrape the once that recently received their first Merit.
|
|
|
P.S: If any mods/admins aren't ok with me scraping the site, by all means let me know. I'd obviously write the bot/script in such a way that it doesn't slam the server & only send a certain amount of requests per second/minute (more or less like a Google bot). I know other users have written similar bots/scraping tools, so I thought it'd be ok. But if not, just let me know I've recently started scraping recent. My script saves the first unedited version of the post in raw HTML, excluding quotes. Your post for example looks like this: Initscri 186520 45883661 <a href="https://bitcointalk.org/index.php#4">Other</a> / <a href="https://bitcointalk.org/index.php?board=24.0">Meta</a> / <b><a href="https://bitcointalk.org/index.php?topic=5032322.msg45883661#msg45883661">"Multiple Accounts" / Copy-pasta detection scripts/bots</a></b>
Hey all,<br /><br />I've been planning to write a few scripts relating to BitcoinTalk. It's been on my "developer bucket list" to write something to detect users who have multiple accounts. In order to accomplish this, and have a reliable list, I'd have to determine some logic in order to base this.<br /><br />I have a few things in mind:<br /><br />Index/scrape posts &:<br /><br />For <b>multiple account detection</b>:<br /><br />- Look for same address usage between posts (BTC, ETH, etc)<br />- Look for same account usage between posts (telegram, skype, etc)<br />- [other ideas here]<br /><br />For <b>copy-pasta detection</b>:<br />- write a script to determine copy-pasta from accounts by matching the text of posts to similar text of other sites in order to return a probability percentage of the user copy/pasting (including src for manual analysis)<br />- [other ideas here]<br /><br />Results would be posted here for mods to look at (if need be), or just to keep a record of such a connection. I'd also probably link to results in <a class="ul" href="https://bitcointalk.org/index.php?topic=1926895.0">this topic</a><br /><br />I wanted to post this thread in advance to see if anyone else had any other logic / ideas in mind for these scripts/bots? This will solely be when I have the time to create this (which won't be for a couple of weeks), so I thought I'd post this well in advance.<br /><br />Thanks! The first line is your Username, then userID, post number, some raw headers, and the last line is the post itself. In compressed format, it takes about 10 MB per day. Instead of scraping the same data again, I could easily send it to you, and a few day's worth of data should be enough for you to start testing. If interested, let me know.
You'll be in for a surprise if you start looking for plagiarism! I sometimes sort a day's worth of posts and search for exact duplicates. This typically gives a few dozen posts that are posted a few dozen times. Most of them are spam, many of them are just spammers posting the same useless "proof of authentication" and more crap like that. Detecting the text spinners will be a whole different level!
|
|
|
Added. Can you add my suggestion in that thread to the OP?
|
|
|
Close Your Started Thread if You already got the Answer For technical questions, it's very helpful to add [SOLVED] to the topic title before locking it. And here's the best thing: if you have a problem, and you're looking for a solution, add "solved" to your search question in Google. It will help you find what you're looking for.
|
|
|
Sorry, I can only scrape what still exists. Unfortunately I don't have the space to store all posts (although I've started this a few days ago, compressed posts excluding quotes only take 10 MB per day). I don't really see much harm in this: a spammer with less spam, how great is that! There are a few more accounts that self-deranked, I think one even went from Hero to Newbie. edit: this forum didn't have the posts anymore, but there is another clone, so you may look at it too I tried the other phishing site (through archive.is, because my PC can't access it) already, but it's gone there too.
|
|
|
Let me test https://bitcointalk.org/index.php?board=666.0: The topic or board you are looking for appears to be either missing or off limits to you. What part of that message isn't clear? It either doesn't exist, or it's not meant for your eyes. There's nothing to do, except for maybe locking this thread.
|
|
|
Are you able to calculate how many jr.members were born from yesterday to today and from who were born? I can, but mocacinno is planning something similar already. For example, if a person gave the birth to 10 or 20 or 30 Jr.Members can starts to be suspicious! o_e_l_e_o is chasing those abusers.
|
|
|
|