VISIMO
Evidence-grounded knowledge systems

Pelarion: AI-Assisted Evidence Review for Scientific Decision-Making

Pelarion is an AI-assisted evidence review system designed to help scientific and technical teams screen, structure, and inspect large bodies of academic and grey-literature sources while preserving provenance, reviewer control, and defensible decision trails.

Case study

The relevant public case-study content is organized here for readability and technical review.

Executive summary

VISIMO developed Pelarion to help scientific and technical reviewers work through fragmented evidence faster while preserving the source context and human judgment required for defensible review. The system is framed around review support, not reviewer replacement: it helps users screen, organize, and inspect evidence while keeping source provenance visible. In testing, the screening pipeline achieved 80% recall and 86% precision, giving reviewers a measurable way to prioritize work while retaining final decision authority.

Challenge

Scientific and technical review teams often work across large volumes of peer-reviewed literature, reports, PDFs, policy documents, technical memos, and grey literature. Traditional manual review can be slow, difficult to reproduce, and hard to audit when source quality, document structure, and relevance vary across a corpus. The challenge is not simply finding documents; it is screening them efficiently while preserving enough evidence to defend why each source was included, excluded, or flagged for further review.

  • Evidence is often scattered across many document types and repositories.
  • Manual screening can be slow and difficult to reproduce at scale.
  • Review teams need transparent inclusion, exclusion, and source-inspection workflows.

Problem requirements

The system needed to accelerate review without obscuring how evidence was found, why a source was prioritized, or where each output originated. Reviewer authority, source provenance, and defensible review artifacts were central requirements, but the tool also needed measurable screening performance so users could trust it as a review accelerator rather than a black-box shortcut.

  • Ingest and normalize technical documents and grey-literature PDFs.
  • Prioritize likely relevant sources while preserving human review decisions.
  • Create outputs that can be inspected, traced, exported, and defended.

Solution

Pelarion combines document ingestion, AI-assisted screening, semantic search, and reviewer workflow support to help users identify relevant sources, inspect evidence, and preserve a reviewable decision trail. The public framing emphasizes evidence-grounded AI: the system helps experts reason over sources rather than generating unsupported conclusions.

  • Grey-literature and technical-document ingestion.
  • AI-assisted relevance and evidence triage.
  • Source-linked outputs for expert review.
  • Page-level anchors and rationale outputs for auditability.

Implementation

VISIMO used a software-first prototype approach to turn a fragmented evidence-review workflow into a structured, inspectable tool. The delivered application combined PDF normalization, sentence-aware document chunking, title and abstract extraction, lexical ranking, supervised learning, full-text screening, and reviewer-facing explainability panels. The system was built, deployed, and alpha-tested as a working web application with asynchronous processing for long-running tasks such as ingestion, de-duplication, metadata extraction, and model-assisted screening.

Results

Pelarion produced concrete screening and extraction results. The screening pipeline reached 80% recall and 86% precision across test datasets, balancing the risk of missing relevant sources against the workload created by false positives. Replacing the original title-and-abstract extraction method with an LLM-based approach improved extraction performance by approximately 50% and successfully extracted titles and abstracts from nearly every uploaded document. A custom metadata extraction feasibility test across 32 biomedical articles identified population size in 96.77% of cases and matched population characteristics in 100% of test documents. These results demonstrate why the system matters: reviewers receive measurable prioritization, traceable rationales, and structured metadata without giving up human control.

  • Achieved 80% recall and 86% precision for model-assisted screening.
  • Improved title and abstract extraction performance by approximately 50%.
  • Demonstrated high-accuracy custom metadata extraction in a 32-article feasibility test.

Applications

Where the work can be applied or adapted.

Scientific and technical reviews

Help research teams screen and structure evidence across academic literature, reports, and grey literature.

Regulatory and policy evidence workflows

Preserve source context and review decisions for teams that need defensible evidence trails.

Mission knowledge management

Adapt the same architecture to acquisition, R&D, technical-library, or institutional-knowledge workflows.

Conclusion

Transition path

Pelarion demonstrates VISIMO's ability to build evidence-grounded AI systems that accelerate technical workflows without removing expert judgment. Its measured screening performance, extraction gains, and source-linked review design create a reusable foundation for scientific review, regulatory evidence, acquisition knowledge, and mission decision-support environments where the source trail matters as much as the answer.

Next step

Turn a mission question into a testable prototype.

VISIMO works with federal stakeholders, primes, research institutions, and technical collaborators on focused AI R&D efforts where software, data, and model evaluation can create practical mission value.

Decision support
AI assurance
Adaptive testing
Geospatial risk
Start the conversation