Question 1

What types of intelligence documents does Threat Lens process?

Accepted Answer

Threat Lens processes structured threat reports, unstructured intelligence cables, OSINT feeds, imagery analysis notes, and multi-source intelligence products. Schema-aware parsers handle the specific format of each document type, preserving internal structure and classification markings.

Question 2

How does Threat Lens achieve higher retrieval accuracy than general-purpose platforms?

Accepted Answer

The system uses embedding models fine-tuned on defense intelligence corpora, adapting the vector representations to the specialized vocabulary of military intelligence. A 2024 Voyage AI study found that domain-specific fine-tuning improves retrieval accuracy by 6 to 7 percentage points on average. Threat Lens demonstrates a 6.9-point improvement (87.3% to 94.2%) on its defense-domain evaluation set.

Question 3

Can Threat Lens be deployed on classified networks?

Accepted Answer

Threat Lens is designed for sovereign deployment — on-premise or in national cloud environments that meet the required classification level. The system does not require connectivity to foreign-hosted cloud services. The generation model can be any LLM that the deployment environment supports.

Question 4

How does sentence-level provenance work?

Accepted Answer

Every sentence in a generated assessment links to the specific document chunk and sentence offsets that support it. The analyst can inspect the original source passage, verify context, and accept, reject, or rewrite each claim individually. This design supports the attribution chain required for formal intelligence products.

Question 5

What is the processing throughput?

Accepted Answer

Threat Lens processes 10,000 documents per hour during batch ingestion. Real-time query response depends on corpus size and infrastructure, with retrieval latency targeted at sub-second for typical analyst queries against corpora of up to several million documents.

Question 6

How does Threat Lens handle documents at different classification levels?

Accepted Answer

Each chunk carries classification-level metadata from ingestion. The retrieval mechanism filters by the analyst's clearance level and the session's classification ceiling, ensuring that higher-classification material is not surfaced in lower-classification queries.

Specification	Value	Context
Document processing throughput	10,000 documents per hour	Batch ingestion of structured threat reports
Top-5 retrieval accuracy (domain-tuned)	94.2%	Evaluated on defense intelligence benchmark set
Top-5 retrieval accuracy (general-purpose baseline)	87.3%	Same evaluation set, general-purpose embeddings
Retrieval accuracy improvement	+6.9 percentage points	Consistent with Voyage AI (2024) and Cisco/NVIDIA (2024) findings
Documents processed to date	2.4 million+	Across three operational evaluation cycles
Provenance granularity	Sentence-level	Each claim linked to source chunk and offsets
Supported document types	Structured reports, cables, OSINT feeds, imagery notes	Schema-aware parsers per document type

Approach	Retrieval Accuracy	Provenance	Sovereignty	Scale
Threat Lens (domain-tuned RAG)	94.2%	Sentence-level	Sovereign deployment	Team to enterprise
Frontier LLM API (e.g., GPT-4 via GenAI.mil)	~87%	None (parametric generation)	U.S. cloud only	Enterprise
Defense platform RAG (e.g., Palantir AIP)	~87–90%	Passage-level	U.S. cloud only	Enterprise
Manual analyst workflow	N/A (human judgment)	Full (human attribution)	Sovereign	Individual

DLRA Threat Lens — RAG-Based Threat Assessment Platform

DLRA Threat Lens: RAG-Based Threat Assessment for Defense Intelligence

Technical Architecture

Ingestion and Chunking

Domain-Tuned Retrieval

Augmented Generation with Citation Constraints

Sentence-Level Provenance

Performance Specifications

Operational Use Cases

Multi-Source Triage

Indicator Extraction and Correlation

Assessment Drafting

Integration and Deployment

Comparison with Alternative Approaches