Skip to main content
construction release_alert
Scholars@Duke will be down for maintenance for approximately one hour starting Tuesday, 11/11 @1pm ET
cancel

A Data Engineering Framework for Ethereum Beacon Chain Rewards: From Data Collection to Decentralization Metrics.

Publication ,  Journal Article
Yan, T; Li, S; Kraner, B; Zhang, L; Tessone, CJ
Published in: Scientific data
March 2025

Ethereum, one of the leading smart contract blockchain platforms, currently operates on a Proof-of-Stake (PoS) consensus mechanism designed to secure the network while incentivizing desired validator behaviors. Despite blockchain technology's promise of decentralization, limitations and gaps in decentralization persist, posing challenges for analysis and optimization. This study introduces a comprehensive dataset of validator rewards from the Ethereum Beacon chain, categorized into attestation, proposer, and sync committee rewards. By providing granular, transparent, and auditable records of validator activities, the dataset addresses the fragmentation of raw blockchain data and enables robust evaluations of PoS incentive structures. Researchers can leverage this dataset to assess enforceable rules, verify protocol compliance, and analyze long-term validator behavior. In addition, we apply decentralization metrics such as the Shannon entropy, Gini Index, Nakamoto Coefficient, and Herfindahl-Hirschman Index (HHI) to showcase the dataset's utility in studying decentralization trends. Publicly available on Harvard Dataverse and accompanied by open-source analytical tools on GitHub, this dataset facilitates future research aimed at enhancing blockchain systems' decentralization, security, and efficiency.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Scientific data

DOI

EISSN

2052-4463

ISSN

2052-4463

Publication Date

March 2025

Volume

12

Issue

1

Start / End Page

519

Related Subject Headings

  • Data Collection
  • Computer Security
  • Blockchain
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Yan, T., Li, S., Kraner, B., Zhang, L., & Tessone, C. J. (2025). A Data Engineering Framework for Ethereum Beacon Chain Rewards: From Data Collection to Decentralization Metrics. Scientific Data, 12(1), 519. https://doi.org/10.1038/s41597-025-04623-7
Yan, Tao, Shengnan Li, Benjamin Kraner, Luyao Zhang, and Claudio J. Tessone. “A Data Engineering Framework for Ethereum Beacon Chain Rewards: From Data Collection to Decentralization Metrics.Scientific Data 12, no. 1 (March 2025): 519. https://doi.org/10.1038/s41597-025-04623-7.
Yan T, Li S, Kraner B, Zhang L, Tessone CJ. A Data Engineering Framework for Ethereum Beacon Chain Rewards: From Data Collection to Decentralization Metrics. Scientific data. 2025 Mar;12(1):519.
Yan, Tao, et al. “A Data Engineering Framework for Ethereum Beacon Chain Rewards: From Data Collection to Decentralization Metrics.Scientific Data, vol. 12, no. 1, Mar. 2025, p. 519. Epmc, doi:10.1038/s41597-025-04623-7.
Yan T, Li S, Kraner B, Zhang L, Tessone CJ. A Data Engineering Framework for Ethereum Beacon Chain Rewards: From Data Collection to Decentralization Metrics. Scientific data. 2025 Mar;12(1):519.

Published In

Scientific data

DOI

EISSN

2052-4463

ISSN

2052-4463

Publication Date

March 2025

Volume

12

Issue

1

Start / End Page

519

Related Subject Headings

  • Data Collection
  • Computer Security
  • Blockchain