Bitcoin Forum
May 03, 2024, 04:10:57 AM *
News: Latest Bitcoin Core release: 27.0 [Torrent]
 
   Home   Help Search Login Register More  
Pages: [1]
  Print  
Author Topic: Hardware requirements for analyzing ouput of rusty-blockparser  (Read 76 times)
eugx0 (OP)
Newbie
*
Offline Offline

Activity: 1
Merit: 0


View Profile
June 22, 2023, 12:09:46 PM
 #1

Hello,

I am trying to do some data analytics on the bitcoin blockchain. I am running a node and then parsing the .dat files using the rusty-blockparser.
This produces four .csv files which the biggest one is nearly 100 GB of data. Due to the size,  I can't load it on a python jupyter notebook. I am wondering what common programs and strategies that people use to process this data.

Thanks for the help,

Best,
1714709457
Hero Member
*
Offline Offline

Posts: 1714709457

View Profile Personal Message (Offline)

Ignore
1714709457
Reply with quote  #2

1714709457
Report to moderator
BitcoinCleanup.com: Learn why Bitcoin isn't bad for the environment
Advertised sites are not endorsed by the Bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
LoyceV
Legendary
*
Offline Offline

Activity: 3304
Merit: 16585


Thick-Skinned Gang Leader and Golden Feather 2021


View Profile WWW
June 23, 2023, 10:48:49 AM
Merited by ABCbits (1)
 #2

This produces four .csv files which the biggest one is nearly 100 GB of data. Due to the size,  I can't load it on a python jupyter notebook.
You can read the data into some database that does what you need, or use things like grep and split to get only the lines you need.

Quote
I am wondering what common programs and strategies that people use to process this data.
Can you start by sharing what you're trying to accomplish? I usually pipe the data through some Linux command line tools to get what I need, you'll find some examples in those topics:

DaveF
Legendary
*
Offline Offline

Activity: 3472
Merit: 6259


Crypto Swap Exchange


View Profile WWW
June 23, 2023, 11:02:38 AM
 #3

Jupyter Notebook is also bloaty software.

Along with what ETF and Loyce said. Use the proper tool for the job, there are better ways to get at the data. I'm assuming (yes I know don't assume) you are a student and have to use Jupyter for class so in the end it has to wind up in there. But, you can't just do A->B you need to get it to a size and format that it can handle.

-Dave


█▀▀▀











█▄▄▄
▀▀▀▀▀▀▀▀▀▀▀
e
▄▄▄▄▄▄▄▄▄▄▄
█████████████
████████████▄███
██▐███████▄█████▀
█████████▄████▀
███▐████▄███▀
████▐██████▀
█████▀█████
███████████▄
████████████▄
██▄█████▀█████▄
▄█████████▀█████▀
███████████▀██▀
████▀█████████
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
c.h.
▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄▄
▀▀▀█











▄▄▄█
▄██████▄▄▄
█████████████▄▄
███████████████
███████████████
███████████████
███████████████
███░░█████████
███▌▐█████████
█████████████
███████████▀
██████████▀
████████▀
▀██▀▀
Pages: [1]
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.19 | SMF © 2006-2009, Simple Machines Valid XHTML 1.0! Valid CSS!