ai初稿

2026-04-02 10:24:22 +08:00
parent 5e2eb7b8c0
commit dc3ae0680c
7 changed files with 857 additions and 14 deletions
--- a/paper_introduction.md
+++ b/paper_introduction.md
@@ -1,31 +1,47 @@
-# AreoRAG: A Physics-Informed Framework for Multi-Source Retrieval Augmented Generation over Planetary Spatial Data
+# AreoRAG: Hyperbolic Spatial Hypergraph and Physics-Informed Conflict Triage for Multi-Source Planetary Retrieval Augmented Generation
+
+Author Name ${}^{1}$ , Author Name ${}^{2\text{ \ding{42} }}$ , Author Name ${}^{1}$
+
+${}^{1}$ Affiliation One
+
+${}^{2}$ Affiliation Two
+
+Email: {author1, author2}@example.edu
+
+**Abstract** — Retrieval Augmented Generation (RAG) has demonstrated considerable promise in grounding Large Language Models (LLMs) with external knowledge for knowledge-intensive question answering. However, extending RAG to the domain of planetary science — where multi-source remote sensing observations are inherently embedded in continuous physical space and inter-source disagreements often carry scientific value — introduces fundamental challenges that existing multi-source RAG frameworks cannot address. These challenges manifest in two critical aspects: (1) existing discrete graph topologies (e.g., multi-source line graphs) suffer from edge explosion when encoding continuous spatial proximity, failing to bridge the gap between physical continuity and semantic discreteness; and (2) conventional conflict-filtering mechanisms, designed under the assumption that inter-source inconsistency implies unreliability, systematically suppress scientifically valuable observational disagreements that are intrinsic to multi-platform deep-space exploration. To address these challenges, we propose AreoRAG, a novel framework tailored for multi-source planetary spatial data retrieval augmented generation. Our framework introduces two key innovations: (1) a Hyperbolic Spatial Hypergraph (HySH) construction module that employs $n$-ary spatial observation hyperedges embedded in hyperbolic space via the Lorentz model, where spatial resolution is coupled with radial depth to faithfully represent the hierarchical scale structure of planetary observations while reducing edge complexity from $O(k^2)$ to $O(k)$; and (2) a Physics-Informed Conflict Triage (PICT) module that detects inter-source conflicts via cross-source interaction entropy, classifies them into four physically grounded categories (noise, instrument-inherent, scale-dependent, and temporal-evolution), and applies differentiated confidence recalibration to preserve scientifically valuable disagreements while filtering genuine noise. Extensive experiments on multi-source planetary observation datasets demonstrate that AreoRAG significantly enhances both the retrieval fidelity and the scientific faithfulness of knowledge-augmented generation in planetary science scenarios.
+
+**Index Terms** — Retrieval Augmented Generation, Planetary Remote Sensing, Hyperbolic Hypergraph, Knowledge Conflict Triage, Multi-source Spatial Data, Mars Exploration

 ## I. INTRODUCTION

-Large Language Models (LLMs) have achieved remarkable success in handling a variety of natural language processing tasks, attributable to their robust capabilities in understanding and generating language and symbols [1]. In knowledge-intensive retrieval tasks, Retrieval Augmented Generation (RAG) has become a standardized solution paradigm [2]–[4]. Previous works [5]–[11] have made significant strides in addressing the inherent knowledge limitations of LLMs by introducing external knowledge bases, markedly improving the accuracy and fidelity of LLM responses. Notably, the synergy between LLMs and Knowledge Graphs (KGs) has been proposed to achieve more efficient and structured information retrieval [12]–[26], propelling the deep reasoning capabilities of RAG in multi-hop question answering, knowledge-intensive retrieval, and multi-source data fusion.
+The past two decades have witnessed an unprecedented accumulation of multi-source remote sensing data from Mars exploration missions. Orbital platforms such as Mars Reconnaissance Orbiter (MRO), Mars Express, and Tianwen-1 continuously acquire observations spanning diverse modalities — from sub-meter optical imagery (HiRISE at 0.3 m/pixel) and medium-resolution contextual mosaics (CTX at 6 m/pixel) to hyperspectral mineralogical mapping (CRISM at 18 m/pixel) and global topographic models (MOLA at ~460 m/pixel). Simultaneously, surface assets including the Curiosity and Zhurong rovers generate complementary in-situ measurements through spectrometers, ground-penetrating radar, and navigation cameras. This rapidly expanding, multi-source, multi-resolution data ecosystem has created a pressing demand for intelligent knowledge retrieval systems that can support planetary scientists in conducting semantic search, cross-source correlation, and multi-scale reasoning over heterogeneous observation archives [1]-[4].

-With the rapid advancement of deep space exploration programs, including NASA's Mars 2020 Perseverance mission, ESA's ExoMars, and CNSA's Tianwen-1 mission, the volume and heterogeneity of planetary observation data have grown at an unprecedented scale [27], [28]. These multi-source datasets — spanning orbital remote sensing imagery (e.g., HiRISE at 0.3m, CTX at 6m, CRISM spectral cubes), in-situ measurements (e.g., rover-mounted spectrometers, ground-penetrating radar), and derived products (e.g., digital terrain models, mineral abundance maps) — collectively constitute a rich yet highly complex knowledge ecosystem for planetary science [29]. The demand for intelligent retrieval over such multi-source planetary data has become increasingly urgent: researchers need to perform spatial semantic search (e.g., "find HiRISE images with dust devil tracks near the equator"), cross-source association (e.g., aggregating multi-resolution data for a target region), and temporally-aware retrieval (e.g., "images captured by Zhurong rover within the first 90 Sols after landing along its southward traverse"). These tasks require the RAG system to bridge the gap between natural language queries and the underlying spatiotemporal structure of planetary observations.
+Large Language Models (LLMs) have emerged as powerful tools for natural language understanding and generation [5], and Retrieval Augmented Generation (RAG) has been established as a standard paradigm for grounding LLM responses in external knowledge bases [6]-[8]. By dynamically retrieving relevant documents and conditioning generation on retrieved context, RAG effectively mitigates the hallucination problem inherent in LLMs and enables knowledge-intensive question answering. The synergy between LLMs and Knowledge Graphs (KGs) has further advanced retrieval performance through structured knowledge representation, achieving notable improvements in multi-hop reasoning, credibility assessment, and interpretability [9]-[13].

-Recent multi-source RAG frameworks, exemplified by MultiRAG [30], have demonstrated promising results in mitigating hallucinations arising from data sparsity and inter-source inconsistency through multi-source line graph construction and multi-level confidence computation. However, these frameworks are fundamentally designed for discrete textual entities (e.g., flight records, book metadata, stock transactions) with explicit semantic associations, and their direct application to planetary spatial data introduces critical structural failures. Building upon the categorization of retrieval challenges in multi-source settings [9], [30], we identify the following failure modes that are unique to multi-source planetary spatial data retrieval:
+Nevertheless, deploying RAG systems for planetary science knowledge retrieval introduces domain-specific complexities that fundamentally challenge existing frameworks. Unlike conventional multi-source retrieval scenarios (e.g., integrating flight records, financial reports, or web documents), planetary observation data possesses two distinctive characteristics. First, all data sources are spatially grounded: each observation is anchored to a specific spatial footprint on the Martian surface, a temporal acquisition window parameterized by Solar Longitude ($L_s$), and instrument-specific parameters such as spectral bands and spatial resolution. The relevance between two observations is therefore governed not merely by textual semantic similarity, but primarily by physical spatial proximity, temporal co-occurrence, and cross-resolution complementarity. Second, inter-source inconsistencies in planetary science are not exclusively indicative of data errors or model hallucinations; rather, they frequently arise as inherent consequences of multi-platform, multi-scale observation and may encode critical scientific discoveries — such as subsurface geological evolution revealed by discrepancies between orbital spectroscopy and in-situ drilling results.

-1) **Spatial proximity collapse**: Existing graph-based RAG methods rely on discrete entity co-occurrence to establish edges. When applied to spatially continuous observation data, encoding spatial proximity (e.g., two overlapping image footprints) as binary edges leads to $O(k^2)$ edge explosion, fundamentally destroying the sparsity-oriented optimizations of line graph structures.
+Recent advances in multi-source RAG, exemplified by MultiRAG [14], have made significant progress in addressing data sparsity and inter-source inconsistency through multi-source line graphs and multi-level confidence computation. However, when confronted with planetary spatial data, these methods encounter two structural bottlenecks that cannot be resolved through parameter tuning alone.

-2) **Scale hierarchy distortion**: Planetary observations inherently form a resolution hierarchy — a single CTX mosaic (6m) spatially contains dozens of HiRISE strips (0.3m), which in turn are nested within MOLA topographic grids (~460m). This containment relationship cannot be faithfully represented by flat, pairwise graph topologies.
+Building upon the analysis of existing multi-source RAG limitations [14]-[16] in the context of planetary science, we identify the following failure modes that are unique to spatially grounded, physically observed multi-source data:

-3) **Scientific conflict erasure**: Multi-level confidence mechanisms designed to filter "unreliable" nodes inadvertently eliminate scientifically valuable observational disagreements. When an orbital spectrometer detects hydrated minerals on the surface while in-situ drilling reveals no such signature at depth, this conflict is not data error but evidence of subsurface geological stratification — a potential major scientific discovery.
+1) **Spatial topology distortion**: When multi-source observations share no common textual entities but are spatially co-located, discrete line graphs fail to establish connectivity, resulting in fragmented retrieval.

-Fig. 1 illustrates the fundamental differences between conventional text-based multi-source retrieval and planetary spatial data retrieval. The continuous spatial embedding, hierarchical resolution structure, and physics-grounded observational conflicts of planetary data are inherently incompatible with discrete graph topologies and de-falsification mechanisms designed for textual knowledge bases. Against this backdrop, we focus on addressing the retrieval challenges unique to multi-source planetary spatial data to empower knowledge-augmented generation for deep space exploration. This work primarily explores the following two fundamental challenges:
+2) **Scale hierarchy collapse**: Observations at different spatial resolutions (e.g., 0.3 m vs. 460 m) exhibit a natural hierarchical containment structure that flat graph topologies cannot represent, leading to loss of cross-resolution context during aggregation.

-**1) Failure of Discrete Representation for Continuous Spatiotemporal Topology.** Multi-source knowledge aggregation methods, such as multi-source line graphs (MLG) [30], [31], rely heavily on discrete text entities and explicit semantic associations to construct graph topology. However, planetary science data is intrinsically embedded in continuous Euclidean physical space. Attempting to encode continuous spatial proximity and directional relationships within traditional discrete graph structures inevitably triggers edge explosion, thereby undermining the efficiency gains that graph-based methods achieve for sparse data distributions. Specifically, for $k$ co-located spatial entities, pairwise spatial encoding requires $\binom{k}{2} = O(k^2)$ edges, while the observation hierarchy (from coarse-resolution global coverage to fine-resolution local strips) demands nested containment relationships that flat graph topologies cannot express. This structural bottleneck prevents existing discrete logical graph structures from bridging the gap between physical continuity and semantic discreteness, constituting a fundamental constraint on planetary spatial reasoning capabilities.
+3) **Scientifically valuable conflict suppression**: Confidence-based conflict filtering indiscriminately eliminates disagreeing nodes, destroying observational evidence that may indicate genuine geological phenomena such as subsurface mineral heterogeneity.

-**2) Contradiction Between Scientific Cognitive Conflict and Traditional De-Falsification Mechanisms.** The core assumption underlying existing multi-source RAG frameworks is that inter-source data inconsistency typically stems from erroneous information or model hallucination, and therefore relies on multi-level confidence computation to eliminate conflicting nodes [30], [33], [34]. However, in deep space exploration scenarios, where absolute ground truth is absent, different observation platforms (e.g., orbiters vs. rovers) often yield significantly conflicting observations of the same target region due to differences in observation scale, penetration depth, and instrument principles. For instance, an orbital spectrometer may detect surface hydrated minerals while in-situ drilling at the same location finds no mineralogical anomaly — such conflict is not data error but an inherent attribute of multi-dimensional scientific observation, potentially containing clues to major scientific discoveries such as geological evolution and subsurface water migration. If existing conflict-filtering mechanisms are applied indiscriminately, severe over-smoothing will result, uniformly erasing high-value scientific anomalies and fundamentally violating the knowledge discovery paradigm of "preserving disagreement, multi-source corroboration" that is central to deep space exploration.
+These failure modes trace back to two fundamental scientific problems:

-To address these challenges, we propose AreoRAG, a novel physics-informed framework designed for multi-source retrieval augmented generation over planetary spatial data. First, we introduce the Hyperbolic Spatial Hypergraph (HySH) for unified spatiotemporal knowledge representation. By employing $n$-ary spatial observation hyperedges, HySH binds co-located multi-source observations into single hyperedges, reducing edge complexity from $O(k^2)$ to $O(k)$. Through scale-aware Lorentz embedding, the resolution hierarchy is naturally encoded via radial depth in hyperbolic space, where the exponential volume growth of negative-curvature geometry faithfully accommodates the exponentially increasing number of observations at finer scales. Second, we propose Physics-Informed Conflict Triage (PICT), which replaces the conventional conflict-filtering paradigm with a classify-then-differentiate strategy. PICT detects inter-source conflicts via cross-source interaction entropy, classifies each conflict into four physically-grounded categories (noise, instrument-inherent, scale-dependent, and temporal-evolution), and applies differentiated confidence recalibration — filtering only noise conflicts while preserving and annotating scientifically valuable disagreements with physical bridging explanations. We provide a formal anti-over-smoothing guarantee ensuring that nodes involved in explainable scientific conflicts can never be filtered out by the confidence mechanism.
+**Problem 1: Discrete Representation Failure for Continuous Spatiotemporal Topology.** Existing multi-source knowledge aggregation methods, such as multi-source line graphs [14], rely on discrete text entities and explicit semantic associations to construct graph topology. However, planetary science data is intrinsically embedded in continuous Euclidean physical space. Attempting to encode continuous spatial proximity and directional relationships within traditional discrete graph structures inevitably triggers an edge explosion problem — $k$ co-located spatial entities require $\binom{k}{2} = O(k^2)$ pairwise spatial proximity edges — thereby destroying the optimizations that existing graph models achieve for data sparsity. The discrete logical graph structure thus constitutes a structural bottleneck constraining planetary spatial reasoning capabilities, unable to bridge the chasm between physical continuity and semantic discreteness.
+
+**Problem 2: Fundamental Conflict Between Scientific Cognitive Divergence and Traditional De-Falsification Mechanisms.** The core assumption underlying existing multi-source RAG frameworks is that inter-source data inconsistency typically originates from misinformation or model hallucinations, and therefore relies on multi-level confidence computation to eliminate conflicting nodes [14], [17]. However, in deep-space exploration scenarios, the absence of absolute ground truth means that different observation platforms (e.g., orbiters versus rovers), due to differences in observation scale, penetration depth, and instrumental principles, often produce significantly conflicting results for the same target region. For instance, orbital spectrometers may detect surface hydrated minerals while in-situ drilling reveals no anomaly — a conflict arising not from data error, but from the inherent multi-dimensional nature of scientific observation, potentially harboring clues to major discoveries such as geological evolution. Applying existing conflict-filtering mechanisms indiscriminately would cause severe over-smoothing, uniformly suppressing high-value scientific anomalies and fundamentally violating the epistemological principle of deep-space exploration: preserving controversy and enabling multi-source corroboration for knowledge discovery.
+
+To address these two fundamental challenges, we propose AreoRAG, a novel framework specifically designed for multi-source planetary spatial data retrieval augmented generation. AreoRAG introduces two synergistic innovations. First, to resolve Problem 1, we construct a Hyperbolic Spatial Hypergraph (HySH) that employs $n$-ary spatial observation hyperedges to bind co-located multi-source observations into single high-order facts, reducing edge complexity from $O(k^2)$ to $O(k)$. These hyperedges are embedded in hyperbolic space via the Lorentz model, where the exponential volume growth of negative-curvature geometry naturally accommodates the hierarchical scale structure of planetary observations — coarse-resolution global data resides near the origin while fine-resolution local data extends toward the boundary. Second, to resolve Problem 2, we develop a Physics-Informed Conflict Triage (PICT) mechanism that replaces the uniform conflict-filtering paradigm with a differentiated triage approach. PICT detects inter-source conflicts through cross-source interaction entropy, classifies each conflict into one of four physically grounded categories (noise, instrument-inherent, scale-dependent, temporal-evolution), and applies category-specific confidence recalibration — filtering genuine noise while provably preserving and even boosting the confidence of scientifically valuable observational disagreements. Together, HySH provides spatially faithful multi-source evidence to PICT, while PICT feeds back triage results to prioritize scientifically interesting regions in subsequent retrieval, forming a tightly coupled framework.

 The contributions of this paper are summarized as follows:

-1) **Hyperbolic Spatial Knowledge Aggregation**: In the knowledge construction module, we introduce the Hyperbolic Spatial Hypergraph as a data structure for unified spatiotemporal representation of multi-source planetary observations. By coupling $n$-ary spatial observation hyperedges with scale-aware Lorentz embedding, this structure simultaneously resolves the edge explosion problem inherent in encoding continuous spatial proximity and faithfully represents the resolution hierarchy through the intrinsic geometry of hyperbolic space. We further introduce the Spatial Outward Einstein Midpoint for cross-resolution aggregation that provably preserves fine-scale observational details.
+1) **Hyperbolic Spatial Hypergraph Construction**: We introduce HySH, a knowledge construction module that employs $n$-ary spatial observation hyperedges embedded in hyperbolic space to achieve unified spatiotemporal representation of multi-source planetary data. By coupling spatial resolution with hyperbolic radial depth via the Lorentz model, HySH faithfully preserves the hierarchical scale structure of planetary observations while eliminating edge explosion through high-order relational encoding. A resolution-aware Spatial Outward Einstein Midpoint (Spatial OEM) aggregation operator is further proposed to prevent hierarchical collapse during cross-resolution evidence fusion, with a formal guarantee of outward bias.

-2) **Physics-Informed Conflict Triage**: In the retrieval module, we propose a conflict detection and classification mechanism grounded in observation physics. By formalizing conflicts through observation geometry parameters and measuring cross-source interaction entropy, we classify inter-source disagreements into four categories with orthogonal physical signatures. A conflict-aware confidence recalibration strategy is designed to filter noise while preserving scientifically explainable conflicts with provenance metadata and physical bridging explanations, accompanied by a formal anti-over-smoothing guarantee (Theorem 2).
+2) **Physics-Informed Conflict Triage**: We propose PICT, a retrieval module that fundamentally redefines the role of inter-source conflict in RAG systems. Through cross-source interaction entropy for conflict detection, a physically grounded four-category conflict classification informed by observation geometry, and differentiated confidence recalibration, PICT provably prevents the over-smoothing of scientifically valuable disagreements (Anti-Over-Smoothing Guarantee) while maintaining noise-filtering capability. To the best of our knowledge, this is the first conflict-handling mechanism in RAG that explicitly distinguishes between erroneous inconsistency and scientifically meaningful observational divergence.

-3) **Experimental Validation and Performance Comparison**: We construct a multi-source planetary spatial retrieval benchmark encompassing orbital imagery, in-situ measurements, and derived products from Mars exploration missions. Extensive experiments demonstrate that AreoRAG significantly outperforms existing state-of-the-art multi-source RAG methods in both retrieval accuracy and scientific conflict preservation, while maintaining competitive efficiency through the compact hyperbolic representation.
+3) **Integrated Framework and Experimental Validation**: We design the AreoRAG Prompting (ARP) algorithm that integrates HySH and PICT through three explicit coupling points: spatial alignment as a prerequisite for interaction entropy computation, radial depth difference as a resolution disparity signal for conflict classification, and triage-driven retrieval priority feedback. Extensive experiments on multi-source planetary observation datasets demonstrate that AreoRAG significantly outperforms existing multi-source RAG methods in both retrieval fidelity and scientific faithfulness, with particular advantages in scenarios involving cross-resolution reasoning and observation-grounded conflict preservation.