landing zone · journey tracker

data

Bulk-data intake surface. Tracks where data lives across cloud accounts, RAID, file backups, the old iMac, and unlabeled drives — and the position of each through a 5-gate journey to its canonical home. The RAID is the home for sorted masters; data.gf.cx is the registry of what hasn't landed there yet.

Welcome back. You're at the start of the journey.
All 18 items sourced across 28–36 TB.
A · SOURCED
here
B · LANDED
C · CATALOGED
D · SORTED
E · ARCHIVED
⚠ next Google Takeout · dare.co.uk · 581 GB · 3d left Gate A →

The journey · 5 stage-gates

GATE A open ›
SOURCED
identified at source · requested · ready to pull
18
items · ~23.1 TB
sub-routines · operational playbook
Google Takeout
takeout.google.com → 'Add to Drive' delivery (resumable via rclone)
Google Workspace
admin.google.com → Data Export (admin role required)
Apple iCloud
privacy.apple.com → request archive (2–7 day prep, 14-day download window)
Old HDs
mount via Thunderbolt dock → diskutil list → ddrescue if flaky
RAID + backups
rclone ls + treesize → catalog pre-dedup baseline
GATE B open ›
LANDED
physical copy on intake spindle
0
items · —
sub-routines · operational playbook
Bulk pull
rclone copy --transfers=4 --checksum --retries=10
Verify integrity
md5sum / sha256sum, write checksums.txt before deleting source
Landing target
external G-Drive (cloud chunks) OR RAID intake partition (>5 TB items)
Bandwidth budget
schedule overnight pulls; 100 Mbps = ~1 TB / 24 hr
GATE C open ›
CATALOGED
contents inventoried · dedup done
0
items · —
sub-routines · operational playbook
Dedup
jdupes -r (cloud) · rmlint (RAID/backups stack of duplicates)
Photo metadata
exiftool -recursive → write csv sidecar
Audio dedup
fpcalc chromaprint → cluster near-duplicates
Forensic catalog
fdupes + file-type histogram (legacy HDs of unknown content)
Filename hygiene
detox -r / convmv -f cp1252 (for Old iMac drives from pre-UTF8 era)
GATE D open ›
SORTED
moved to canonical home
0
items · —
sub-routines · operational playbook
Photos → RAID
rsync → /RAID/photos/// (then Immich indexes)
Email → RAID
rsync → /RAID/email// (mbox/maildir)
Docs → pa.gf.cx
OCR + retag → pa.gf.cx purchase/asset/service records
Derivatives → R2
~1500px JPG + 1080p MP4 + thumb + sidecar JSON (per derivatives-in-R2 pattern)
Manifest update
set target_home in migrations.yml, re-bake
GATE E open ›
ARCHIVED
intake record retained · source can be wiped
0
items · —
sub-routines · operational playbook
Source decommission
Old HDs: diskutil secureErase 1 (single-pass zero, sufficient for spinners)
Cloud account
Optional: delete Google Takeout source archive after RAID checksum verified
Intake record
set state: archived in migrations.yml; record retained for audit trail
Backup posture
RAID + R2 derivatives + offsite cold copy (3-2-1)

Left column = gate identity + tally · Right column = the playbook of operations performed at that gate. Items move top-to-bottom through the gates.

Urgent · hard deadlines

⚠ urgent · 3 days left
Google Takeout · dare.co.uk
Size 581 GB
Deadline 2026-06-01 · hard stop
Currently Google servers
Target external G-Drive → TBD
Gate SOURCED

First inbound item. 3 days from log to expire. Worth verifying delivery mode (direct-link vs Add-to-Drive) before pulling - Drive mode is resumable, direct-link is not.

Three intake categories

Cloud accounts
10–11 TB · 8 sources
8 distinct sources · all named 2026-05-29 · hard deadlines once requested
8
items
RAID + file backups
10–15 TB
duplicated several times · needs dedup before sort
1
items
Old iMac + old HDs
8–10 TB · 8 sources
random + unknown contents · forensic catalog needed
9
items

Stack status

G-Drive intake spindle assumed physical landing; capacity TBD
RAID home spindle load-bearing per derivatives-in-R2/originals-on-RAID pattern
migrations.yml manifest schema v0, populated 2026-05-29
Time Machine network share not configured
Dan's Mac -> data.gf.cx backup target not pointed
Audrey's Mac -> data.gf.cx backup target not pointed

Items by gate

GATE A   SOURCED — identified at source · requested · ready to pull 18 items
Google Takeout · dare.co.uk cloud 581 GB ready ⚠ 2026-06-01 (3d)
Google Takeout · smart.sellars cloud 1.5 TB pending
Google Takeout · Cindy Gmail cloud 500 GB pending
Google Workspace export · Audrey Workspace · audreyinc cloud 2.0 TB pending
Google Workspace export · Audrey Workspace · audreylam cloud 2.0 TB pending
Apple Privacy export (iCloud) · Audrey iCloud · audreyinc cloud 1.0 TB pending
Apple Privacy export (iCloud) · Audrey iCloud · audreylam cloud 1.0 TB pending
Apple Privacy export (iCloud) · Family iCloud · dan@dareusa.com cloud 2.0 TB pending
existing RAID + file backups raid_backups 10–15 TB pending
Old iMac legacy_hds size TBD pending
Old HD (1 of 8) — unlabeled legacy_hds size TBD pending
Old HD (2 of 8) — unlabeled legacy_hds size TBD pending
Old HD (3 of 8) — unlabeled legacy_hds size TBD pending
Old HD (4 of 8) — unlabeled legacy_hds size TBD pending
Old HD (5 of 8) — unlabeled legacy_hds size TBD pending
Old HD (6 of 8) — unlabeled legacy_hds size TBD pending
Old HD (7 of 8) — unlabeled legacy_hds size TBD pending
Old HD (8 of 8) — unlabeled legacy_hds size TBD pending
data.gf.cx · baked 2026-05-29 from migrations.yml · 18 items · Variant B · vertical + sub-routines · schema v0