A short reduce in spam posts is recognizable Maybe the Merit requirement can be raised again when many Jr. Members continue spamming. At some point they'll run out of sMerit. This is a serious problem:
|
|
|
Meta - one of the few safe harbors from spam has be decimated by threads about merit. I've never had to report so many posts in one of my threads in Meta before!
|
|
|
That way other people might think twice before buying or exchanging 1 merit to get out of the "newbie hell" (source needed for the expression "newbie hell"... I don't remember who came up with the term, but it's my new go-to expression) I saw it first used by MagicSmoker, but the oldest reference I can find is Omega0255 more than 7 years ago. I might make a thread as soon as friday's merit statistics become published (i don't want to make a thread for a script that can't be used right away)... I'll update mine in about 48 hours. Theymos' original data will be a couple hours earlier. Sorry to have hijacked your thread, i'll try not to do it again ![Wink](https://bitcointalk.org/Smileys/default/wink.gif) No problem, I started it ![Tongue](https://bitcointalk.org/Smileys/default/tongue.gif)
|
|
|
My proposition was to filter out potential abusers using my script, then manually verify the list and only punish those that received merits for posts that are complete garbage... This leaves the possibility the shitposter received Merit without buying it. The sender on the other hand is for sure guilty in this scenario. Your project deserves it's own thread!
|
|
|
Moderators bans a man who violated the rules and its forbidden to create a new account on the forum for him. I got a ban for something I didn't do and now I cant write on the forum That's one of the risks of buying an account. You won't be unbanned.
|
|
|
Could DT users review The only evidence I see, is you buying trust spam.
|
|
|
You missed my favourite one: Some people just shouldn't be on here. If they post continuous spam and haven't bothered to read the rules then they don't deserve anybodies time.
If someone came to your house with dirty shoes (ignoring the please take off your shoes sign) turned on the TV loudly, wrote on your furniture, slowed down your internet by downloading constantly, invited their friends over and ate from your fridge empty and was there purely for the "free food and drink". Would you tell them what they were doing wrong or tell them to go away in a less than polite way ?
|
|
|
I think @mocacinno will be able to scraped all the newbies who ranked up to Jr. Member from yesterday to today, but he won't(at least take lot of scraping) catch up the Jr. members who demoted yesterday but ranked up again within the same day. Scraping all accounts that received Merit isn't much work, especially if you only scrape the once that recently received their first Merit.
|
|
|
P.S: If any mods/admins aren't ok with me scraping the site, by all means let me know. I'd obviously write the bot/script in such a way that it doesn't slam the server & only send a certain amount of requests per second/minute (more or less like a Google bot). I know other users have written similar bots/scraping tools, so I thought it'd be ok. But if not, just let me know ![Smiley](https://bitcointalk.org/Smileys/default/smiley.gif) I've recently started scraping recent. My script saves the first unedited version of the post in raw HTML, excluding quotes. Your post for example looks like this: Initscri 186520 45883661 <a href="https://bitcointalk.org/index.php#4">Other</a> / <a href="https://bitcointalk.org/index.php?board=24.0">Meta</a> / <b><a href="https://bitcointalk.org/index.php?topic=5032322.msg45883661#msg45883661">"Multiple Accounts" / Copy-pasta detection scripts/bots</a></b>
Hey all,<br /><br />I've been planning to write a few scripts relating to BitcoinTalk. It's been on my "developer bucket list" to write something to detect users who have multiple accounts. In order to accomplish this, and have a reliable list, I'd have to determine some logic in order to base this.<br /><br />I have a few things in mind:<br /><br />Index/scrape posts &:<br /><br />For <b>multiple account detection</b>:<br /><br />- Look for same address usage between posts (BTC, ETH, etc)<br />- Look for same account usage between posts (telegram, skype, etc)<br />- [other ideas here]<br /><br />For <b>copy-pasta detection</b>:<br />- write a script to determine copy-pasta from accounts by matching the text of posts to similar text of other sites in order to return a probability percentage of the user copy/pasting (including src for manual analysis)<br />- [other ideas here]<br /><br />Results would be posted here for mods to look at (if need be), or just to keep a record of such a connection. I'd also probably link to results in <a class="ul" href="https://bitcointalk.org/index.php?topic=1926895.0">this topic</a><br /><br />I wanted to post this thread in advance to see if anyone else had any other logic / ideas in mind for these scripts/bots? This will solely be when I have the time to create this (which won't be for a couple of weeks), so I thought I'd post this well in advance.<br /><br />Thanks! The first line is your Username, then userID, post number, some raw headers, and the last line is the post itself. In compressed format, it takes about 10 MB per day. Instead of scraping the same data again, I could easily send it to you, and a few day's worth of data should be enough for you to start testing. If interested, let me know.
You'll be in for a surprise if you start looking for plagiarism! I sometimes sort a day's worth of posts and search for exact duplicates. This typically gives a few dozen posts that are posted a few dozen times. Most of them are spam, many of them are just spammers posting the same useless "proof of authentication" and more crap like that. Detecting the text spinners will be a whole different level!
|
|
|
Added. Can you add my suggestion in that thread to the OP?
|
|
|
Close Your Started Thread if You already got the Answer For technical questions, it's very helpful to add [SOLVED] to the topic title before locking it. And here's the best thing: if you have a problem, and you're looking for a solution, add "solved" to your search question in Google. It will help you find what you're looking for.
|
|
|
Sorry, I can only scrape what still exists. Unfortunately I don't have the space to store all posts (although I've started this a few days ago, compressed posts excluding quotes only take 10 MB per day). I don't really see much harm in this: a spammer with less spam, how great is that! There are a few more accounts that self-deranked, I think one even went from Hero to Newbie. edit: this forum didn't have the posts anymore, but there is another clone, so you may look at it too I tried the other phishing site (through archive.is, because my PC can't access it) already, but it's gone there too.
|
|
|
Let me test https://bitcointalk.org/index.php?board=666.0: The topic or board you are looking for appears to be either missing or off limits to you. What part of that message isn't clear? It either doesn't exist, or it's not meant for your eyes. There's nothing to do, except for maybe locking this thread.
|
|
|
Are you able to calculate how many jr.members were born from yesterday to today and from who were born? I can, but mocacinno is planning something similar already. For example, if a person gave the birth to 10 or 20 or 30 Jr.Members can starts to be suspicious! o_e_l_e_o is chasing those abusers.
|
|
|
This way spammers can't cheat the activity threshold by deleting posts So they'll just cheat the post count by deleting posts.
|
|
|
Totally revised version (original quoted at the end of this post)!This makes adding new data less work ![Smiley](https://bitcointalk.org/Smileys/default/smiley.gif) Theymos announced the "Enhanced newbie restrictions & requirements" in post 45810047. This was on Sep 17 06:27 (Dutch time). Post 44746501 was made 3 weeks (plus 60 minutes) earlier. In 3 weeks (+60 minutes) before the announcement, 1063546 posts were made. On average, in 3 weeks before the announcement, the forum saw 353813 new posts per week. This is the number I'll use to compare results with. Post 46096455 was made 1 week later. Post 46376958 was made 2 weeks later. Post 46636652 was made 3 weeks later. Post 46900337 was made 4 weeks later. Post 47140676 was made 5 weeks later. Post 47380721 was made 6 weeks later. Post 47600410 was made 7 weeks later. Post 47818184 was made 8 weeks later. Post 48033319 was made 9 weeks later. Post 48232805 was made 10 weeks later. Post 48410472 was made 11 weeks later. Post 48569803 was made 12 weeks later. Post 48714784 was made 13 weeks later. Post 48854475 was made 14 weeks later. Post 48978781 was made 15 weeks later. That means: - In the first week after the announcement, 286408 posts were made (-19.05%).
- In the second week after the announcement, 280503 posts were made (-20.72%).
- In the third week after the announcement, 259694 posts were made (-26.60%).
- In the fourth week after the announcement, 263685 posts were made (-25.47%).
- In the fifth week after the announcement, 240339 posts were made (-32.07%).
- In the sixth week after the announcement, 240045 posts were made (-32.15%).
- In the seventh week after the announcement, 219689 posts were made (-37.91%).
- In the eighth week after the announcement, 217774 posts were made (-38.45%).
- In the nineth week after the announcement, 215135 posts were made (-39.20%).
- In the tenth week after the announcement, 199486 posts were made (-43.62%).
- In the eleventh week after the announcement, 177677 posts were made (-49.78%).
- In the twelfth week after the announcement, 159331 posts were made (-54.97%).
- In the thirteenth week after the announcement, 144981 posts were made (-59.02%).
- In the fourteenth week after the announcement, 139691 posts were made (-60,52%).
- In the fifteenth week after the announcement, 124306 posts were made (-64,87%).
My old OP:I was curious: Theymos announced the "Enhanced newbie restrictions & requirements" in post 45810047. This was on Sep 17 06:27 (Dutch time). Post 45752895 was made 24 hours earlier, and post 45853867 was made 24 hours later. Post 45705944 was made 48 hours earlier, post 45894771 was made 48 hours later. Post 45469471 was made 1 week (plus 7 minutes) earlier, post 46096594 was made 1 week (plus 7 minutes) later. Post 45113538 was made 2 weeks (plus 19 minutes) earlier, post 46377227 was made 2 weeks (plus 19 minutes) later. Post 44746501 was made 3 weeks (plus 60 minutes) earlier, post 46637557 was made 3 weeks (plus 60 minutes) later. That means: - In 24 hours before the announcement, 57152 posts were made.
- In 24 hours after the announcement, 43820 posts were made.
- In 48 hours before the announcement, 104103 posts were made.
- In 48 hours after the announcement, 84724 posts were made.
- In 1 week (+7 minutes) before the announcement, 340576 posts were made.
- In 1 week (+7 minutes) after the announcement, 286547 posts were made.
- In 2 weeks (+19 minutes) before the announcement, 696509 posts were made.
- In 2 weeks (+19 minutes) after the announcement, 567180 posts were made.
- In 3 weeks (+60 minutes) before the announcement, 1063546 posts were made.
- In 3 weeks (+60 minutes) after the announcement, 827510 posts were made.
In 24 hours, that's 23.3% less posts. In 48 hours, that's 18.6% less posts. In 1 week, that's 15.9% less posts. In 2 weeks, that's 18.6% less posts. In 3 weeks, that's 22.2% less posts.
When I click Show unread posts since last visit., I can now see some actual real threads again! There is still ANN and BOUNTY pollution, but in between are real topics (I have no boards on ignore on this account). Update: theymos posted data on the number of posts per day in the past 30 days. Update: My list of new Merit receivers (including both abusers as well as innocent users): List of demoted Jr. Members who were ranked up again.
|
|
|
Your Activity dropped from 336 to 170, which means you've done some serious housekeeping. And you've dropped the bold font! I'm glad to see some people are willing to change their spam, so keep going like this ![Smiley](https://bitcointalk.org/Smileys/default/smiley.gif) Now all that's left to do, is fix your quote above. It should look like this: I have tried deleting them but couldn't. Why not? Click here and click the delete button. Then open your post history and do the same to all other spam. Since you have 23 replies and 15 topics deleted by moderators, you would do the forum a favour if you clean up on your own. Thank you for your time today, I have cleaned all past history, I'm now a new me, all my posts shall now be of original and authentic, also no form of spam involved. don't know were to send my apologizes but know my actions starting from today henceforth will speak for me. Thank you ones again. (Let's make bitcointalk great again) Which means you need to add: [quote author=CryptopreneurBrainboss link=topic=5031673.msg45859731#msg45859731 date=1537257492] and: at the right spot.
|
|
|
|