BitPolito/bitcoin-data-analysis: A Python library for analyzing Bitcoin on-chain and Lightning Network data, including providers to fetch data and functions for analysis.
josibake/bitcoin-data-analysis: A repository with notebooks and scripts for analyzing Bitcoin data, often leveraging public datasets like Google's BigQuery.
A tool for extracting data from Bitcoin-like sri lanka phone number list blockchains into a relational database model.
jnewbery/bitcointools: Tools for working with bitcoind data files (like mempool.dat, peers.dat, and wallet.dat).
3. Curated Bitcoin Datasets (Often Linked from GitHub)
While the full Bitcoin blockchain (over 660 GB) is too large to host directly on GitHub, you'll find repositories that:
Provide smaller, curated datasets: E.g., lists of mining pools, non-standard transactions, or block arrival times.
Offer tools to access public datasets: Projects that provide code examples or interfaces to query large Bitcoin datasets hosted on cloud platforms like Google Cloud's BigQuery. Google has a public dataset of the Bitcoin blockchain that is continuously updated, and many GitHub projects demonstrate how to query it using SQL.
Store metadata: Repositories might back up GitHub issues and pull requests related to Bitcoin development.
Example of an organization that curates Bitcoin-related datasets on GitHub:
bitcoin-data organizationBitcoin: Understanding the Blockchain Genesis Block.