Full Hack Report — Amazon × MIT dataset · 898,415 stops · 6,112 routes · 5 U.S. metros
Scroll for the complete analysis, or use the sidebar to jump to a section
Hack Report · May 2026
Dashboard
Project overview, dataset context, and key performance metrics from the cleaned Amazon last-mile data.
Project overview
Last-mile delivery accounts for ~53% of total shipping costs. This analysis uses real Amazon delivery data (2018) across five US metros to improve routing efficiency, reduce operational cost, and lower emissions.
Dataset
Amazon × MIT Last Mile Routing Challenge — GPS coordinates, route records, stop-level delivery details
~23.2 kg more CO₂ per route than LA. Routing improvements alone can cut this without new EVs.
Chicago — 59.6 kg/route
Second-highest emissions, aligned with long average route distance (283.9 km).
Optimizer
What-if calculator — estimate savings if Boston routes matched Los Angeles efficiency.
Optimization simulator
What-if savings calculator
If Boston routes matched LA efficiency, how much would be saved?
Interactive
Route optimization level: 50%
Distance saved per route
55.3 km
out of 110.5 km excess
CO₂ saved per route
11.6 kg
out of 23.2 kg excess
Fleet annual CO₂ savings
70.9K kg
across ~6,112 Boston routes/yr
Clustering
K-Means zone identification, silhouette validation, and delivery territory analysis.
K-Means analysis
K-Means silhouette score
Peaks at K=5 with score 0.96 — optimal clusters
Clustering
Silhouette scoreOptimal (K=5)
Elbow method — inertia
Steep drop K=2→5, then flattens — diminishing returns beyond 5 clusters
K-Means
Clustering validation (report)
Both methods confirm optimal K = 5, matching the five metro regions
Elbow method
Inertia drops sharply between K=2 and K=5, then improvement flattens significantly.
Silhouette score
Peaks at 0.96 for K=5 — exceptionally high for real-world geospatial clustering. Declines sharply beyond K=5.
K-Means on 50,000 random delivery coordinates separated five perfectly distinct geographic zones with essentially zero overlap between metros.
K-Means delivery zones
5 clusters identified with silhouette score 0.96 — zero overlap between metro regions
Zone 1 — Los Angeles
Zone 2 — Chicago
Zone 3 — Seattle
Zone 4 — Boston
Zone 5 — Austin
Cluster quality metrics
Optimal K
K = 5
Silhouette score
0.96
Cluster overlap
~0%
Sample size
50K
Insights
Key findings, strategic recommendations, and overall conclusions from the Hack Report.
Key insights
Boston is 59% less efficient
Drivers travel ~110 km more per route than LA while completing the same number of stops — driven by historic road layouts and water barriers.
No new EVs needed
Boston produces 23.2 kg more CO₂ per route than LA. Better routing alone — no new vehicles or infrastructure — can substantially cut emissions.
Clustering score of 0.96
K-Means perfectly identified all 5 metro zones with near-zero cluster overlap — validating automated territory design over manual boundaries.
Recommendations
1. Prioritize Boston
Highest avg route distance (298.8 km) and CO₂ (62.7 kg/route). Redesign territories, improve stop sequencing, optimize depot coverage.
2. Benchmark Los Angeles
Highest volume (411,552 stops) yet shortest routes (188.3 km). Use LA as the efficiency benchmark for other metros.
3. Expand clustering
Natural delivery zones emerge from stop coordinates (silhouette 0.96). Use clustering for territory design instead of static admin boundaries.
4. Improve Austin depots
Lowest volume (31,060 stops) with high route distance (249.6 km). Reposition fulfillment hubs to reduce travel as demand grows.
Overall findings
Key conclusions from the Hack Report analysis
Geographic clustering is highly effective — K=5 with silhouette 0.96 validates automated territory planning.
Route inefficiency varies substantially by city — Boston routes are ~59% longer than LA with nearly identical stop counts (~146.7).
CO₂ emissions track routing efficiency — Boston emits ~23.2 kg more CO₂ per route than LA (59% higher).
Operations are highly standardized — bell curve centered at 146.7 stops (σ ±31), so optimization scales predictably fleet-wide.
Delivery density improves efficiency — LA’s high volume still yields the shortest average route distance when properly optimized.
Interactive App
Explore K-Means delivery clusters on an interactive dark map — zoom, pan, and click stops for details.
Interactive geospatial app
K-Means delivery clusters on map
Explore delivery stops colored by cluster zone across five U.S. metros. Click any point for zone, city, and stop details. Built with Folium + Leaflet (aws_delivery_map.html).
Zone 1 — Los AngelesZone 2 — ChicagoZone 3 — BostonZone 4 — SeattleZone 5 — Austin
Export
Print the full analysis or export raw metrics as JSON / CSV.
Export report data
Download Hack Report
Print saves all sections, charts, and insights. JSON/CSV include city-level metrics only.
Use Print → Save as PDF for a full report. JSON/CSV: 5 cities + global stats (898,415 stops, 6,112 routes, K=5, silhouette 0.96).