Data transfer and metadata replication
Purpose¶
- Find the missing NetCDF files on ORNL data node and transfer them
- Replicate all the data on LLNL data nodes to the ORNL data node
Terms¶
- LLNL metadata: any metadata in which the value of
data_nodeis one of esgf-data1.llnl.gov, esgf-data2.llnl.gov, and aims3.llnl.gov - ORNL metadata: any metadata in which the value of
data_nodeis esgf-node.ornl.gov
Two phases of data transfer¶
-
Phase 1: Retrieve all data paths from the ORNL metadata URLs, verify their existence on the ORNL data node, and ensure they have non-zero sizes. For any missing or zero-size files, locate and transfer them from the LLNL and ANL data nodes.
- Missing files: 557,560
- Successfully transferred/found: 547,123
- Paths requiring fix: 46,500
- Files unavailable at DOE sites: 10,432
-
Phase 2: Identify all LLNL metadata entries missing in ORNL, extract their data paths, and transfer the corresponding files from LLNL to ORNL.
- Successfully transferred/found: 158,796(CMIP6), 1588(input4MIPs), 5533(DRCDP), 9(CMIP6Plus):
- Files unavailable at DOE sites: 170