6.0 KiB
6.0 KiB
CLAUDE.md
This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
Project Overview
This is a LaTeX research paper titled "An I/O-Efficient Approach for Concurrent Spatio-Temporal Range Retrievals over Large-Scale Remote Sensing Image Data" submitted to an IEEE journal. The paper proposes novel techniques for efficient retrieval of remote sensing imagery, including:
- Index-as-an-Execution-Plan paradigm: Integrates fine-grained partial retrieval directly into indexing structure
- Dual-layer inverted index (G2I/I2G): Pre-materializes grid-to-pixel mappings to eliminate runtime geometric calculations
- Hybrid concurrency-aware I/O coordination: Combines Calvin-style deterministic ordering with optimistic execution
- SA-GMAB (Surrogate-Assisted Genetic Multi-Armed Bandit): Auto-tuning mechanism for fluctuating workloads
Build and Compilation
Primary Build Commands
# Standard compilation (recommended for IEEE format)
pdflatex rs_retrieval.tex
# Alternative compilation (being tested)
xelatex rs_retrieval.tex
# Full build cycle (includes bibliography)
pdflatex rs_retrieval.tex
bibtex rs_retrieval
pdflatex rs_retrieval.tex
pdflatex rs_retrieval.tex
Build System
- Compiler: pdfTeX (MiKTeX distribution on Windows)
- Document Class: IEEEtran (IEEE journal format)
- Output: 15-page PDF (~2.36MB)
- No automation: No Makefile or build scripts - use manual compilation
Document Structure
The paper follows standard IEEE journal organization with these main sections:
- Introduction - Motivation and problem statement
- Related Work - Spatio-temporal retrieval, concurrency control, I/O tuning
- Problem Formulation - Mathematical definitions and cost models
- I/O-aware Indexing Structure (Section 3) - Core technical contribution
- Grid-to-Image (G2I) index
- Image-to-Grid (I2G) index
- Pre-materialized execution plans
- Hybrid Concurrency-Aware I/O Coordination (Section 4)
- Deterministic vs optimistic execution modes
- Adaptive mode switching
- I/O Stack Tuning (Section 5) - SA-GMAB algorithm
- Performance Evaluation (Section 6) - Experimental results on Martian datasets
- Conclusions - Summary of contributions
Key Files
rs_retrieval.tex- Main LaTeX source (single-file document)references.bib- Bibliography databasefig/- Figures directory (index.png, st-query.png, cc.png)exp/- Experimental results (PDF charts)
LaTeX Package Dependencies
Required Packages
\documentclass[lettersize,journal]{IEEEtran}
\usepackage{amsmath,amsfonts} % Mathematics
\usepackage{graphicx} % Figures
\usepackage[linesnumbered,lined,ruled]{algorithm2e} % Algorithms
\usepackage{cite} % Citations
\usepackage{array} % Table formatting
\usepackage{makecell} % Table cells
\usepackage{subfigure} % Subfigures
Chinese Language Support
- The project directory name includes Chinese characters (遥感影像部分检索)
- Document content is in English
- Uses ctex distribution (Chinese TeX) on the system
Document Conventions
Cross-References
All sections use \label{} and \ref{} for cross-referencing:
- Section labels:
sec:XXformat (e.g.,\label{sec:Index}) - Algorithm labels:
alg:XXformat - Figure labels:
fig:XXformat - Equation labels:
eq:XXformat
Mathematical Notation
- Extensive use of mathematical formulations
- Cost models use notation:
C_{total},T_{compute}, etc. - Algorithm pseudo-code uses algorithm2e package
Citation Style
- IEEE citation style with numeric references
- Citations in format:
\cite{AuthorYearKEY} - Bibliography managed in
references.bib
Figure Organization
Figures are organized by topic:
fig/index.png- Index schema designfig/st-query.png- Retrieval-time execution flowfig/cc.png- Concurrency coordination mechanism
Common Editing Tasks
Adding a New Section
- Add
\section{Section Name}with\label{sec:NAME} - Update the table of contents/organization paragraph in Introduction
- Ensure cross-references use correct label format
Modifying Algorithms
- Use
algorithm2eenvironment - Keep
linesnumbered,lined,ruledoptions for consistency - Label with
\label{alg:NAME}for referencing
Adding Figures
- Place figure files in
fig/directory - Use
\begin{figure}[t]for top placement (IEEE convention) - Include
\caption{}and\label{fig:NAME} - Refer using
\ref{fig:NAME}
Bibliography Updates
- Add entries to
references.bib - Use BibTeX key format:
AuthorYearKEY(e.g.,Ma15RS_bigdata) - Run
bibtex rs_retrievalafter modifying .bib file - Compile LaTeX twice to resolve references
Important Notes
Compilation Workflow
When making changes that affect:
- Text only: Single
pdflatexrun sufficient - Citations: Run
pdflatex→bibtex→pdflatex× 2 - New sections/labels: Run
pdflatextwice to resolve cross-references - Figures: Ensure all figure files exist before compilation
Git Repository
- Main branch:
main - Recent activity: Testing XeLaTeX compilation
- Modified files tracked: .tex, .pdf, .aux, .log, .synctex.gz
Document Formatting
- Strict IEEE journal format compliance
- Font: Times Roman family
- Two-column layout
- Letter size paper
- 15-page final document
Known Issues
- Some font variants (bold/italic) unavailable in current TeX distribution
- Testing migration from pdflatex to xelatex (commit
f7ffed8)
Experimental Data Reference
The paper evaluates on Martian remote sensing datasets:
- Total volume: 51.9 TB across 669,641 images
- Datasets: MoRIC, CTX, THEMIS, HiRISE
- Environment: 9-node cluster with HBase and Lustre file system
- Metrics: Latency, I/O throughput, request collapse efficiency
Results show:
- Order-of-magnitude latency reduction with I/O-aware indexing
- 54x speedup under high contention with hybrid coordination
- 2x faster recovery from workload shifts with SA-GMAB