11 KiB
AreoRAG: A Physics-Informed Framework for Multi-Source Retrieval Augmented Generation over Planetary Spatial Data
I. INTRODUCTION
Large Language Models (LLMs) have achieved remarkable success in handling a variety of natural language processing tasks, attributable to their robust capabilities in understanding and generating language and symbols [1]. In knowledge-intensive retrieval tasks, Retrieval Augmented Generation (RAG) has become a standardized solution paradigm [2]–[4]. Previous works [5]–[11] have made significant strides in addressing the inherent knowledge limitations of LLMs by introducing external knowledge bases, markedly improving the accuracy and fidelity of LLM responses. Notably, the synergy between LLMs and Knowledge Graphs (KGs) has been proposed to achieve more efficient and structured information retrieval [12]–[26], propelling the deep reasoning capabilities of RAG in multi-hop question answering, knowledge-intensive retrieval, and multi-source data fusion.
With the rapid advancement of deep space exploration programs, including NASA's Mars 2020 Perseverance mission, ESA's ExoMars, and CNSA's Tianwen-1 mission, the volume and heterogeneity of planetary observation data have grown at an unprecedented scale [27], [28]. These multi-source datasets — spanning orbital remote sensing imagery (e.g., HiRISE at 0.3m, CTX at 6m, CRISM spectral cubes), in-situ measurements (e.g., rover-mounted spectrometers, ground-penetrating radar), and derived products (e.g., digital terrain models, mineral abundance maps) — collectively constitute a rich yet highly complex knowledge ecosystem for planetary science [29]. The demand for intelligent retrieval over such multi-source planetary data has become increasingly urgent: researchers need to perform spatial semantic search (e.g., "find HiRISE images with dust devil tracks near the equator"), cross-source association (e.g., aggregating multi-resolution data for a target region), and temporally-aware retrieval (e.g., "images captured by Zhurong rover within the first 90 Sols after landing along its southward traverse"). These tasks require the RAG system to bridge the gap between natural language queries and the underlying spatiotemporal structure of planetary observations.
Recent multi-source RAG frameworks, exemplified by MultiRAG [30], have demonstrated promising results in mitigating hallucinations arising from data sparsity and inter-source inconsistency through multi-source line graph construction and multi-level confidence computation. However, these frameworks are fundamentally designed for discrete textual entities (e.g., flight records, book metadata, stock transactions) with explicit semantic associations, and their direct application to planetary spatial data introduces critical structural failures. Building upon the categorization of retrieval challenges in multi-source settings [9], [30], we identify the following failure modes that are unique to multi-source planetary spatial data retrieval:
-
Spatial proximity collapse: Existing graph-based RAG methods rely on discrete entity co-occurrence to establish edges. When applied to spatially continuous observation data, encoding spatial proximity (e.g., two overlapping image footprints) as binary edges leads to
O(k^2)edge explosion, fundamentally destroying the sparsity-oriented optimizations of line graph structures. -
Scale hierarchy distortion: Planetary observations inherently form a resolution hierarchy — a single CTX mosaic (6m) spatially contains dozens of HiRISE strips (0.3m), which in turn are nested within MOLA topographic grids (~460m). This containment relationship cannot be faithfully represented by flat, pairwise graph topologies.
-
Scientific conflict erasure: Multi-level confidence mechanisms designed to filter "unreliable" nodes inadvertently eliminate scientifically valuable observational disagreements. When an orbital spectrometer detects hydrated minerals on the surface while in-situ drilling reveals no such signature at depth, this conflict is not data error but evidence of subsurface geological stratification — a potential major scientific discovery.
Fig. 1 illustrates the fundamental differences between conventional text-based multi-source retrieval and planetary spatial data retrieval. The continuous spatial embedding, hierarchical resolution structure, and physics-grounded observational conflicts of planetary data are inherently incompatible with discrete graph topologies and de-falsification mechanisms designed for textual knowledge bases. Against this backdrop, we focus on addressing the retrieval challenges unique to multi-source planetary spatial data to empower knowledge-augmented generation for deep space exploration. This work primarily explores the following two fundamental challenges:
1) Failure of Discrete Representation for Continuous Spatiotemporal Topology. Multi-source knowledge aggregation methods, such as multi-source line graphs (MLG) [30], [31], rely heavily on discrete text entities and explicit semantic associations to construct graph topology. However, planetary science data is intrinsically embedded in continuous Euclidean physical space. Attempting to encode continuous spatial proximity and directional relationships within traditional discrete graph structures inevitably triggers edge explosion, thereby undermining the efficiency gains that graph-based methods achieve for sparse data distributions. Specifically, for k co-located spatial entities, pairwise spatial encoding requires \binom{k}{2} = O(k^2) edges, while the observation hierarchy (from coarse-resolution global coverage to fine-resolution local strips) demands nested containment relationships that flat graph topologies cannot express. This structural bottleneck prevents existing discrete logical graph structures from bridging the gap between physical continuity and semantic discreteness, constituting a fundamental constraint on planetary spatial reasoning capabilities.
2) Contradiction Between Scientific Cognitive Conflict and Traditional De-Falsification Mechanisms. The core assumption underlying existing multi-source RAG frameworks is that inter-source data inconsistency typically stems from erroneous information or model hallucination, and therefore relies on multi-level confidence computation to eliminate conflicting nodes [30], [33], [34]. However, in deep space exploration scenarios, where absolute ground truth is absent, different observation platforms (e.g., orbiters vs. rovers) often yield significantly conflicting observations of the same target region due to differences in observation scale, penetration depth, and instrument principles. For instance, an orbital spectrometer may detect surface hydrated minerals while in-situ drilling at the same location finds no mineralogical anomaly — such conflict is not data error but an inherent attribute of multi-dimensional scientific observation, potentially containing clues to major scientific discoveries such as geological evolution and subsurface water migration. If existing conflict-filtering mechanisms are applied indiscriminately, severe over-smoothing will result, uniformly erasing high-value scientific anomalies and fundamentally violating the knowledge discovery paradigm of "preserving disagreement, multi-source corroboration" that is central to deep space exploration.
To address these challenges, we propose AreoRAG, a novel physics-informed framework designed for multi-source retrieval augmented generation over planetary spatial data. First, we introduce the Hyperbolic Spatial Hypergraph (HySH) for unified spatiotemporal knowledge representation. By employing $n$-ary spatial observation hyperedges, HySH binds co-located multi-source observations into single hyperedges, reducing edge complexity from O(k^2) to O(k). Through scale-aware Lorentz embedding, the resolution hierarchy is naturally encoded via radial depth in hyperbolic space, where the exponential volume growth of negative-curvature geometry faithfully accommodates the exponentially increasing number of observations at finer scales. Second, we propose Physics-Informed Conflict Triage (PICT), which replaces the conventional conflict-filtering paradigm with a classify-then-differentiate strategy. PICT detects inter-source conflicts via cross-source interaction entropy, classifies each conflict into four physically-grounded categories (noise, instrument-inherent, scale-dependent, and temporal-evolution), and applies differentiated confidence recalibration — filtering only noise conflicts while preserving and annotating scientifically valuable disagreements with physical bridging explanations. We provide a formal anti-over-smoothing guarantee ensuring that nodes involved in explainable scientific conflicts can never be filtered out by the confidence mechanism.
The contributions of this paper are summarized as follows:
-
Hyperbolic Spatial Knowledge Aggregation: In the knowledge construction module, we introduce the Hyperbolic Spatial Hypergraph as a data structure for unified spatiotemporal representation of multi-source planetary observations. By coupling $n$-ary spatial observation hyperedges with scale-aware Lorentz embedding, this structure simultaneously resolves the edge explosion problem inherent in encoding continuous spatial proximity and faithfully represents the resolution hierarchy through the intrinsic geometry of hyperbolic space. We further introduce the Spatial Outward Einstein Midpoint for cross-resolution aggregation that provably preserves fine-scale observational details.
-
Physics-Informed Conflict Triage: In the retrieval module, we propose a conflict detection and classification mechanism grounded in observation physics. By formalizing conflicts through observation geometry parameters and measuring cross-source interaction entropy, we classify inter-source disagreements into four categories with orthogonal physical signatures. A conflict-aware confidence recalibration strategy is designed to filter noise while preserving scientifically explainable conflicts with provenance metadata and physical bridging explanations, accompanied by a formal anti-over-smoothing guarantee (Theorem 2).
-
Experimental Validation and Performance Comparison: We construct a multi-source planetary spatial retrieval benchmark encompassing orbital imagery, in-situ measurements, and derived products from Mars exploration missions. Extensive experiments demonstrate that AreoRAG significantly outperforms existing state-of-the-art multi-source RAG methods in both retrieval accuracy and scientific conflict preservation, while maintaining competitive efficiency through the compact hyperbolic representation.