landing zone · journey tracker
data
Bulk-data intake surface. Tracks where data lives across cloud accounts, RAID, file backups, the old iMac, and unlabeled drives — and the position of each through a 5-gate journey to its canonical home. The RAID is the home for sorted masters; data.gf.cx is the registry of what hasn't landed there yet.
Welcome back. You're at the start of the journey.
All 18 items sourced across 28–36 TB.
A · SOURCED
here
B · LANDED
C · CATALOGED
D · SORTED
E · ARCHIVED
The journey · 5 stage-gates
|
GATE A open ›
SOURCED
identified at source · requested · ready to pull
18
items · ~23.1 TB
|
sub-routines · operational playbook
Google Takeout
takeout.google.com → 'Add to Drive' delivery (resumable via rclone)
Google Workspace
admin.google.com → Data Export (admin role required)
Apple iCloud
privacy.apple.com → request archive (2–7 day prep, 14-day download window)
Old HDs
mount via Thunderbolt dock → diskutil list → ddrescue if flaky
RAID + backups
rclone ls + treesize → catalog pre-dedup baseline
|
|
GATE B open ›
LANDED
physical copy on intake spindle
0
items · —
|
sub-routines · operational playbook
Bulk pull
rclone copy --transfers=4 --checksum --retries=10
Verify integrity
md5sum / sha256sum, write checksums.txt before deleting source
Landing target
external G-Drive (cloud chunks) OR RAID intake partition (>5 TB items)
Bandwidth budget
schedule overnight pulls; 100 Mbps = ~1 TB / 24 hr
|
|
GATE C open ›
CATALOGED
contents inventoried · dedup done
0
items · —
|
sub-routines · operational playbook
Dedup
jdupes -r (cloud) · rmlint (RAID/backups stack of duplicates)
Photo metadata
exiftool -recursive → write csv sidecar
Audio dedup
fpcalc chromaprint → cluster near-duplicates
Forensic catalog
fdupes + file-type histogram (legacy HDs of unknown content)
Filename hygiene
detox -r / convmv -f cp1252 (for Old iMac drives from pre-UTF8 era)
|
|
GATE D open ›
SORTED
moved to canonical home
0
items · —
|
sub-routines · operational playbook
Photos → RAID
rsync → /RAID/photos/
Email → RAID
rsync → /RAID/email/
Docs → pa.gf.cx
OCR + retag → pa.gf.cx purchase/asset/service records
Derivatives → R2
~1500px JPG + 1080p MP4 + thumb + sidecar JSON (per derivatives-in-R2 pattern)
Manifest update
set target_home in migrations.yml, re-bake
|
|
GATE E open ›
ARCHIVED
intake record retained · source can be wiped
0
items · —
|
sub-routines · operational playbook
Source decommission
Old HDs: diskutil secureErase 1 (single-pass zero, sufficient for spinners)
Cloud account
Optional: delete Google Takeout source archive after RAID checksum verified
Intake record
set state: archived in migrations.yml; record retained for audit trail
Backup posture
RAID + R2 derivatives + offsite cold copy (3-2-1)
|
Left column = gate identity + tally · Right column = the playbook of operations performed at that gate. Items move top-to-bottom through the gates.
Urgent · hard deadlines
⚠ urgent · 3 days left
Google Takeout · dare.co.uk
| Size | 581 GB |
| Deadline | 2026-06-01 · hard stop |
| Currently | Google servers |
| Target | external G-Drive → TBD |
| Gate | SOURCED |
First inbound item. 3 days from log to expire. Worth verifying delivery mode (direct-link vs Add-to-Drive) before pulling - Drive mode is resumable, direct-link is not.
Three intake categories
|
Cloud accounts
10–11 TB · 8 sources
|
8 distinct sources · all named 2026-05-29 · hard deadlines once requested
|
8
items
|
|
RAID + file backups
10–15 TB
|
duplicated several times · needs dedup before sort
|
1
items
|
|
Old iMac + old HDs
8–10 TB · 8 sources
|
random + unknown contents · forensic catalog needed
|
9
items
|
Stack status
| ● | G-Drive intake spindle | assumed physical landing; capacity TBD |
| ● | RAID home spindle | load-bearing per derivatives-in-R2/originals-on-RAID pattern |
| ● | migrations.yml manifest | schema v0, populated 2026-05-29 |
| ● | Time Machine network share | not configured |
| ● | Dan's Mac -> data.gf.cx | backup target not pointed |
| ● | Audrey's Mac -> data.gf.cx | backup target not pointed |
Items by gate
GATE A SOURCED — identified at source · requested · ready to pull 18 items
| Google Takeout · dare.co.uk | cloud | 581 GB | ready | ⚠ 2026-06-01 (3d) |
| Google Takeout · smart.sellars | cloud | 1.5 TB | pending | |
| Google Takeout · Cindy Gmail | cloud | 500 GB | pending | |
| Google Workspace export · Audrey Workspace · audreyinc | cloud | 2.0 TB | pending | |
| Google Workspace export · Audrey Workspace · audreylam | cloud | 2.0 TB | pending | |
| Apple Privacy export (iCloud) · Audrey iCloud · audreyinc | cloud | 1.0 TB | pending | |
| Apple Privacy export (iCloud) · Audrey iCloud · audreylam | cloud | 1.0 TB | pending | |
| Apple Privacy export (iCloud) · Family iCloud · dan@dareusa.com | cloud | 2.0 TB | pending | |
| existing RAID + file backups | raid_backups | 10–15 TB | pending | |
| Old iMac | legacy_hds | size TBD | pending | |
| Old HD (1 of 8) — unlabeled | legacy_hds | size TBD | pending | |
| Old HD (2 of 8) — unlabeled | legacy_hds | size TBD | pending | |
| Old HD (3 of 8) — unlabeled | legacy_hds | size TBD | pending | |
| Old HD (4 of 8) — unlabeled | legacy_hds | size TBD | pending | |
| Old HD (5 of 8) — unlabeled | legacy_hds | size TBD | pending | |
| Old HD (6 of 8) — unlabeled | legacy_hds | size TBD | pending | |
| Old HD (7 of 8) — unlabeled | legacy_hds | size TBD | pending | |
| Old HD (8 of 8) — unlabeled | legacy_hds | size TBD | pending |
data.gf.cx · baked 2026-05-29 from
migrations.yml ·
18 items · Variant B · vertical + sub-routines · schema v0