Evaluating constellation feasibility using programmatic analysis of massive geospatial datasets.
The scope of Nova Capital's investment has expanded. OrbitWeb isn't just launching rockets; they are building a global high-speed internet constellation.
To evaluate the technical feasibility of this constellation, Managing Director Sara needs you to analyze their proposed orbital shells. You request the data from the Space Situational Awareness (SSA) database, identifying all 25,000+ objects currently in Low Earth Orbit (LEO), along with their telemetry vectors.
You try to open the 1.2 GB file in Excel to calculate spatial overlaps and collision probabilities. Excel freezes and crashes. Returning to your desk, Sara looks at the frozen screen:
"Spreadsheets won't cut it anymore. We are dealing with big data now. OrbitWeb claims their 550km orbits are clear, but if Kuiper is targeting the same shell, the risk of a Kessler Syndrome event increases exponentially. Write a script to find the congestion hotspots by end of day."
Why do satellites cluster? It’s not random. Communication satellites need specific "shells" (altitudes) for low latency, while Earth Observation satellites need specific inclinations for sun-synchronicity. Analyzing these shells requires us to treat space not as a void, but as a congested highway system.
When data exceeds 1 million cells or requires complex string parsing, we switch to programming.
pandas, which is essentially Excel but for
Python. It can read our data directly from the internet. Click the "Play" button next to the cell to
run it.junk that only contains rows where the object type is 'DEBRIS'.
If you must stay in Sheets, use the Query Language. It is significantly faster than normal formulas for big datasets because it processes the data on Google's servers, not in your browser.
=QUERY(. This allows you to write SQL-like code inside a
single cell to summarize thousands of rows instantly.=QUERY('Sheet1'!A:Z, "SELECT B, C WHERE C > 500 ORDER BY C DESC")
Orbits are continuous, but analysis is discrete. We need to create "bins" (e.g., 200km-400km, 400km-600km)
using pd.cut() to see where the traffic jams are occuring across LEO.
Data science isn't about doing the task once; it's about writing a recipe so the task does itself. Your final Python script or Sheet should be built so that if Nova Capital gives you a new data dump tomorrow, you just press "Play" and get the exact same analysis without clicking any buttons.
IF() formula in Sheets—or a simple if statement in Python—that flags any
shell where the number of objects is dangerously high.