Bitcoin Forum
May 05, 2024, 07:08:46 PM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: [1]
  Print  
Author Topic: Stats you would like to see?  (Read 280 times)
Aero Blue (OP)
Full Member
***
Offline Offline

Activity: 154
Merit: 128


View Profile
August 09, 2019, 06:00:07 PM
Merited by DdmrDdmr (2), malevolent (1), LoyceV (1), hugeblack (1), tranthidung (1), morvillz7z (1), PrimeNumber7 (1)
 #1

Currently I'm messing around with Python in order to scrape data for analysis later on. I know there are already a lot of people who have scrapers that will do this, but I'm looking for a more unique approach. So far the main thing I've been focusing on is "user stats" AKA gathering information from all their posts and plotting it.

Here are some examples of some stats I am currently able to obtain (these are from my profile):



At this point I'm kind of stuck on what exactly to do next, and I'm sure a lot of you have some great ideas! Also, tell me your thoughts on the current stats, I'm always looking for ways to improve / different ways of displaying info.


My Legacy #NoPaidSig BTC: 14EQ2GvKbfkwU8JZ8PdWoAqDUWByzRAn4E
"With e-currency based on cryptographic proof, without the need to trust a third party middleman, money can be secure and transactions effortless." -- Satoshi
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
1714936126
Hero Member
*
Offline Offline

Posts: 1714936126

View Profile Personal Message (Offline)

Ignore
1714936126
Reply with quote  #2

1714936126
Report to moderator
1714936126
Hero Member
*
Offline Offline

Posts: 1714936126

View Profile Personal Message (Offline)

Ignore
1714936126
Reply with quote  #2

1714936126
Report to moderator
actmyname
Copper Member
Legendary
*
Offline Offline

Activity: 2562
Merit: 2504


Spear the bees


View Profile WWW
August 09, 2019, 06:48:33 PM
 #2

Since user stats have been disabled, it's good to see selective scraping.

An average of characters/post would be good. Same thing with posts/day. If we target users with typical spammer statistics, we can gather a list of them.
I've been reporting tons of spammers however a high number of the most egregious one-liner burst-posting spam megathread posters have slipped through my fingers.
How about "average time between posts"?

dkbit98
Legendary
*
Offline Offline

Activity: 2226
Merit: 7129



View Profile WWW
August 09, 2019, 07:57:13 PM
 #3

I would be interested to see What Topics are most active for every Bitcointalk rank in separate.

.
.HUGE.
▄██████████▄▄
▄█████████████████▄
▄█████████████████████▄
▄███████████████████████▄
▄█████████████████████████▄
███████▌██▌▐██▐██▐████▄███
████▐██▐████▌██▌██▌██▌██
█████▀███▀███▀▐██▐██▐█████

▀█████████████████████████▀

▀███████████████████████▀

▀█████████████████████▀

▀█████████████████▀

▀██████████▀▀
█▀▀▀▀











█▄▄▄▄
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
.
CASINSPORTSBOOK
▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄
▀▀▀▀█











▄▄▄▄█
angel55
Full Member
***
Offline Offline

Activity: 630
Merit: 172



View Profile
August 09, 2019, 07:58:11 PM
 #4

Can you find what user has the most posts without receiving a single merit?  Don't count airdropped merits.
Aero Blue (OP)
Full Member
***
Offline Offline

Activity: 154
Merit: 128


View Profile
August 09, 2019, 10:15:54 PM
 #5

Since user stats have been disabled, it's good to see selective scraping.

An average of characters/post would be good. Same thing with posts/day. If we target users with typical spammer statistics, we can gather a list of them.
I've been reporting tons of spammers however a high number of the most egregious one-liner burst-posting spam megathread posters have slipped through my fingers.
How about "average time between posts"?

Yes, I already have the ability to do average char count. When you say posts/day, what timeframe would you like for that? I could do from when the account first posted but I'm assuming you would like a more recent value, maybe for the past month or the past week? I'm working on average time between posts right now, probably going to put the timeframe at about 1 week or so and then plot posts that are less than 30 minutes apart, etc.

I would be interested to see What Topics are most active for every Bitcointalk rank in separate.


Can you find what user has the most posts without receiving a single merit?  Don't count airdropped merits.

I will not be doing merit related things as Loyce already has that covered. I am interested in only focusing on user related stats (excluding merit), so things I am able to derive from post history / profile page.

My Legacy #NoPaidSig BTC: 14EQ2GvKbfkwU8JZ8PdWoAqDUWByzRAn4E
philipma1957
Legendary
*
Offline Offline

Activity: 4116
Merit: 7850


'The right to privacy matters'


View Profile WWW
August 09, 2019, 10:38:25 PM
 #6

check my posts out   should be

mining sha 256
marketplace
alt coins
meta

▄▄███████▄▄
▄██████████████▄
▄██████████████████▄
▄████▀▀▀▀███▀▀▀▀█████▄
▄█████████████▄█▀████▄
███████████▄███████████
██████████▄█▀███████████
██████████▀████████████
▀█████▄█▀█████████████▀
▀████▄▄▄▄███▄▄▄▄████▀
▀██████████████████▀
▀███████████████▀
▀▀███████▀▀
.
 MΞTAWIN  THE FIRST WEB3 CASINO   
.
.. PLAY NOW ..
dkbit98
Legendary
*
Offline Offline

Activity: 2226
Merit: 7129



View Profile WWW
August 09, 2019, 10:55:04 PM
Last edit: August 09, 2019, 11:15:17 PM by dkbit98
 #7

Quote

I will not be doing merit related things as Loyce already has that covered. I am interested in only focusing on user related stats (excluding merit), so things I am able to derive from post history / profile page.

I never asked for anything merit related:

''I would be interested to see What Topics are most active for every Bitcointalk rank in separate.''

Thanks

.
.HUGE.
▄██████████▄▄
▄█████████████████▄
▄█████████████████████▄
▄███████████████████████▄
▄█████████████████████████▄
███████▌██▌▐██▐██▐████▄███
████▐██▐████▌██▌██▌██▌██
█████▀███▀███▀▐██▐██▐█████

▀█████████████████████████▀

▀███████████████████████▀

▀█████████████████████▀

▀█████████████████▀

▀██████████▀▀
█▀▀▀▀











█▄▄▄▄
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
.
CASINSPORTSBOOK
▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄
▀▀▀▀█











▄▄▄▄█
Aero Blue (OP)
Full Member
***
Offline Offline

Activity: 154
Merit: 128


View Profile
August 09, 2019, 11:56:01 PM
Last edit: August 10, 2019, 02:39:43 AM by Aero Blue
 #8

Quote

I will not be doing merit related things as Loyce already has that covered. I am interested in only focusing on user related stats (excluding merit), so things I am able to derive from post history / profile page.

I never asked for anything merit related:

''I would be interested to see What Topics are most active for every Bitcointalk rank in separate.''

Thanks

Yes, however you did ask for something related to "board statistics" meaning that I would have to scrape thousands of profiles, which is not "user specific". Maybe in the future I will be able to collect stats like that, but right now it's not feasible. Unless there is a way to get around what I've just described I can't do it for right now.

check my posts out   should be

mining sha 256
marketplace
alt coins
meta

well after crashing a few times due to your ridiculous 30k posts:


My Legacy #NoPaidSig BTC: 14EQ2GvKbfkwU8JZ8PdWoAqDUWByzRAn4E
tranthidung
Legendary
*
Offline Offline

Activity: 2268
Merit: 4009


Farewell o_e_l_e_o


View Profile WWW
August 10, 2019, 03:52:10 AM
 #9

I would like to have (if you can)
- Median and interquartile range of merits per post in each board.
- Mean and standard deviations of merits per post in each board.
Without statistics, I can guess figures for serious boards are some-fold higher than in spam-boards (like Bitcoin discussion, altcoin discussion, etc.), but it is interesting if you can retrieve those stats with your skills.
There are some reference for you (you might have some ideas from them):
Time Series Analysis on Distributed Merits in the forum (daily, weekly, monthly)
Time Series on monthly statistics of forum (new users, new topics, new posts)
Assumed monthly statistics on registered accounts of bitcointalk.org (2009-2019)
Observation on interquartile range of intra-day merits with time series plot
Some stats of forum in the WO thread (Oct. 2017 - Jul. 2019) Monthly update
Bitcointalk Merit Dashboard https://public.tableau.com/profile/ddmrddmr#!/

▄▄███████▄▄
▄██████████████▄
▄██████████████████▄
▄████▀▀▀▀███▀▀▀▀█████▄
▄█████████████▄█▀████▄
███████████▄███████████
██████████▄█▀███████████
██████████▀████████████
▀█████▄█▀█████████████▀
▀████▄▄▄▄███▄▄▄▄████▀
▀██████████████████▀
▀███████████████▀
▀▀███████▀▀
.
 MΞTAWIN  THE FIRST WEB3 CASINO   
.
.. PLAY NOW ..
o_e_l_e_o
In memoriam
Legendary
*
Offline Offline

Activity: 2268
Merit: 18509


View Profile
August 10, 2019, 07:29:06 AM
 #10

Can you find what user has the most posts without receiving a single merit?  Don't count airdropped merits.
You can find this pretty easily by using Vod's BPIP's "Most Posts" lists here: https://bpip.org/report.aspx?r=mostposts

The account with the most posts but no merit is ChartBuddy at 21804 posts. They are however a bot account, which hasn't made a single post since the introduction of the merit system.

The highest posting non bot account with no merit is notlist3d at 15110 posts, but again, they've only made 3 posts since the introduction of merit.
LoyceV
Legendary
*
Offline Offline

Activity: 3304
Merit: 16599


Thick-Skinned Gang Leader and Golden Feather 2021


View Profile WWW
August 10, 2019, 08:47:12 AM
 #11

How about "average time between posts"?
The average will just be one number (based on the total number of posts since registration). A "burst post" graph could be interesting to show the distribution of time between posts. Say:
-number of posts within less than 2 minutes: x
-number of posts within 2-5 minutes: y
-5-10 minutes: z
-10-30 minutes: a
-30-120 minutes: b
-120-720 minutes: c

You catch my drift Smiley

Aero Blue (OP)
Full Member
***
Offline Offline

Activity: 154
Merit: 128


View Profile
August 10, 2019, 08:24:59 PM
Merited by LoyceV (1)
 #12

How about "average time between posts"?
The average will just be one number (based on the total number of posts since registration). A "burst post" graph could be interesting to show the distribution of time between posts. Say:
-number of posts within less than 2 minutes: x
-number of posts within 2-5 minutes: y
-5-10 minutes: z
-10-30 minutes: a
-30-120 minutes: b
-120-720 minutes: c

You catch my drift Smiley

Here is an example of what I could come up with:



It gets super complex really quick when talking about distribution. This is the best I can do as my knowledge in stats is limited. Hopefully I can figure something out that looks better but this is at least something to look at.

Edit: Added legend so it's a bit more readable.

My Legacy #NoPaidSig BTC: 14EQ2GvKbfkwU8JZ8PdWoAqDUWByzRAn4E
PrimeNumber7
Copper Member
Legendary
*
Offline Offline

Activity: 1624
Merit: 1899

Amazon Prime Member #7


View Profile
August 10, 2019, 09:19:31 PM
 #13

You can try to measure interest in threads posted in. You can measure this by the percentage of threads that person has posted in, which they posted exactly one post, two or more posts, and 5 or more posts.

Depending on your skill level, you can also scrape each thread a person has posted in, and count the number of times they were quoted, and the number of times they subsequently posted after being quoted. Or you could measure the number of times a person posted in a thread, at least one post by someone else was made, and the person posted in the thread a subsequent time. Both of these should measure engagement.

You could also measure how many times a person is quoted in a thread after they post in a thread. This should measure how interesting their posts are.
tranthidung
Legendary
*
Offline Offline

Activity: 2268
Merit: 4009


Farewell o_e_l_e_o


View Profile WWW
August 11, 2019, 02:30:08 AM
 #14


Looks nice! Can you get some of my statistics, please.
- Average posts per day
- Average merits per merited posts
It is nice if you can make plots, with given dataset (then I can play around with my dataset).
Thanks in advance, fella.

▄▄███████▄▄
▄██████████████▄
▄██████████████████▄
▄████▀▀▀▀███▀▀▀▀█████▄
▄█████████████▄█▀████▄
███████████▄███████████
██████████▄█▀███████████
██████████▀████████████
▀█████▄█▀█████████████▀
▀████▄▄▄▄███▄▄▄▄████▀
▀██████████████████▀
▀███████████████▀
▀▀███████▀▀
.
 MΞTAWIN  THE FIRST WEB3 CASINO   
.
.. PLAY NOW ..
Pages: [1]
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!