Evidence-Based Text Forensics Platform

Analyzing Content Authenticity Through Linguistic & Statistical Evidence

A forensic analysis system that evaluates textual evidence using multiple statistical, linguistic, and semantic signals to assess content authenticity across education, publishing, hiring, and research domains.

Low
False-Positive Bias (Domain-Calibrated)
6
Total Forensic Signals
10s
Average Processing Time

Why Choose Our Platform?

Advanced technology meets practical application

đŸŽ¯

Domain-Aware Analysis

Calibrated thresholds for Academic, Technical, Creative, and Casual content types with specialized analysis algorithms for each domain.

đŸ”Ŧ

6-Signal Evidence Ensemble

Combines perplexity, entropy, structural, linguistic, semantic, and perturbation-stability signals to form a multi-angle forensic evidence profile

💡

Explainable Results

Sentence-level highlighting with confidence scores and detailed forensic reasoning for each assessment.

🚀

Fast Processing

Analyze short texts in 1.2 seconds, medium documents in 3.5 seconds with parallel metric computation.

📄

Multi-Format Support

Upload and analyze TXT, PDF, DOCX, DOC, and Markdown files with automatic text extraction.

Forensic Signals Explained

Understanding the science behind the forensic evaluation

📊

Perplexity Weight: 25%

Measures how predictable the text is using reference language model. Model-generated or algorithmically assisted text typically exhibits lower perplexity (more predictable) than human writing, which tends to be more varied and surprising.

🎲

Entropy Weight: 20%

Calculates token-level diversity and unpredictability in text sequences. Human writing shows higher entropy with more varied word choices, while algorithmically generated text tends toward more uniform token distributions.

📈

Structural Analysis Weight: 15%

Analyzes sentence length variance, punctuation patterns, and lexical burstiness. Human writing exhibits more variation in sentence structure and rhythm compared to algorithmically generated text, which often shows more uniform patterns.

📝

Linguistic Analysis Weight: 15%

Evaluates POS tag diversity, syntactic complexity, and grammatical patterns. Examines the richness of language structures and whether they match natural human linguistic variation.

🧠

Semantic Analysis Weight: 15%

Assesses semantic coherence, repetition patterns, and contextual consistency. Identifies semantic consistency patterns that often differ between human-authored and algorithmically generated text.

🔍

Multi-Perturbation Stability Weight: 10%

Tests text stability under random perturbations. Algorithmically generated text tends to maintain higher likelihood scores even when slightly modified, while human text shows more variation.

Submit Content for Analysis

📄
Click to upload or drag and drop
Supported formats: TXT, PDF, DOCX, DOC, MD
Maximum file size: 10MB
Show suspicious sentences
More accurate but slower analysis
Show text analysis statistics

Analysis Report

✓

Ready for Analysis

Paste text or upload a document to begin evidence-based forensic analysis. Our multi-signal ensemble will provide detailed, explainable insights.

📝

Run an analysis to see sentence-level highlighting

📊

Run an analysis to see detailed metric breakdowns