Bitcoin Forum
April 20, 2024, 02:23:34 AM *
News: Latest Bitcoin Core release: 26.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: [1]
  Print  
Author Topic: Hardware requirements for analyzing ouput of rusty-blockparser  (Read 74 times)
eugx0 (OP)
Newbie
*
Offline Offline

Activity: 1
Merit: 0


View Profile
June 22, 2023, 12:09:46 PM
 #1

Hello,

I am trying to do some data analytics on the bitcoin blockchain. I am running a node and then parsing the .dat files using the rusty-blockparser.
This produces four .csv files which the biggest one is nearly 100 GB of data. Due to the size,  I can't load it on a python jupyter notebook. I am wondering what common programs and strategies that people use to process this data.

Thanks for the help,

Best,
"With e-currency based on cryptographic proof, without the need to trust a third party middleman, money can be secure and transactions effortless." -- Satoshi
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
1713579814
Hero Member
*
Offline Offline

Posts: 1713579814

View Profile Personal Message (Offline)

Ignore
1713579814
Reply with quote  #2

1713579814
Report to moderator
1713579814
Hero Member
*
Offline Offline

Posts: 1713579814

View Profile Personal Message (Offline)

Ignore
1713579814
Reply with quote  #2

1713579814
Report to moderator
1713579814
Hero Member
*
Offline Offline

Posts: 1713579814

View Profile Personal Message (Offline)

Ignore
1713579814
Reply with quote  #2

1713579814
Report to moderator
LoyceV
Legendary
*
Offline Offline

Activity: 3290
Merit: 16498


Thick-Skinned Gang Leader and Golden Feather 2021


View Profile WWW
June 23, 2023, 10:48:49 AM
Merited by ABCbits (1)
 #2

This produces four .csv files which the biggest one is nearly 100 GB of data. Due to the size,  I can't load it on a python jupyter notebook.
You can read the data into some database that does what you need, or use things like grep and split to get only the lines you need.

Quote
I am wondering what common programs and strategies that people use to process this data.
Can you start by sharing what you're trying to accomplish? I usually pipe the data through some Linux command line tools to get what I need, you'll find some examples in those topics:

█▀▀▀











█▄▄▄
▀▀▀▀▀▀▀▀▀▀▀
e
▄▄▄▄▄▄▄▄▄▄▄
█████████████
████████████▄███
██▐███████▄█████▀
█████████▄████▀
███▐████▄███▀
████▐██████▀
█████▀█████
███████████▄
████████████▄
██▄█████▀█████▄
▄█████████▀█████▀
███████████▀██▀
████▀█████████
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
c.h.
▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄
▀▀▀█











▄▄▄█
▄██████▄▄▄
█████████████▄▄
███████████████
███████████████
███████████████
███████████████
███░░█████████
███▌▐█████████
█████████████
███████████▀
██████████▀
████████▀
▀██▀▀
DaveF
Legendary
*
Offline Offline

Activity: 3458
Merit: 6209


Crypto Swap Exchange


View Profile WWW
June 23, 2023, 11:02:38 AM
 #3

Jupyter Notebook is also bloaty software.

Along with what ETF and Loyce said. Use the proper tool for the job, there are better ways to get at the data. I'm assuming (yes I know don't assume) you are a student and have to use Jupyter for class so in the end it has to wind up in there. But, you can't just do A->B you need to get it to a size and format that it can handle.

-Dave


█▀▀▀











█▄▄▄
▀▀▀▀▀▀▀▀▀▀▀
e
▄▄▄▄▄▄▄▄▄▄▄
█████████████
████████████▄███
██▐███████▄█████▀
█████████▄████▀
███▐████▄███▀
████▐██████▀
█████▀█████
███████████▄
████████████▄
██▄█████▀█████▄
▄█████████▀█████▀
███████████▀██▀
████▀█████████
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
c.h.
▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄
▀▀▀█











▄▄▄█
▄██████▄▄▄
█████████████▄▄
███████████████
███████████████
███████████████
███████████████
███░░█████████
███▌▐█████████
█████████████
███████████▀
██████████▀
████████▀
▀██▀▀
Pages: [1]
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!