REL_DOC: BLUEPRINT_TECH_SPEC_V0.8.8

TECHNICAL_SPECIFICATION

BLUEPRINT

ARCHITECTURE

A comprehensive technical overview of the Projekt Blueprint OSINT platform—from its agentic RAG system and entity extraction pipeline to its real-time streaming architecture and enterprise scalability patterns. Built for intelligence at scale.

SECTION_01

TECHNOLOGY STACK

CORE_INFRASTRUCTURE

FRAMEWORKNEXT.JS 14

LANGUAGETYPESCRIPT 5

DATABASEPOSTGRESQL 16

VECTOR_STOREPGVECTOR + HNSW

FULL_TEXT_SEARCHPOSTGRESQL GIN

ORMPRISMA 5

JOB_QUEUEBULLMQ + REDIS

REAL_TIMESSE (SERVER-SENT EVENTS)

AUTHENTICATIONNEXTAUTH.JS

DEPLOYMENTDOCKER COMPOSE

AI_INFERENCE_LAYER

Multi-Provider LLM

Hot-swappable LLM backends. Google Gemini, OpenAI GPT-4, Anthropic Claude, or local Ollama/LM Studio. Provider abstraction via unified interface.

Function Calling

Structured output via native function calling APIs. Gemini function-based entity extraction for reliable, typed data extraction.

Vector Embeddings

Document and entity embeddings for semantic similarity search. pgvector with HNSW indexing for O(log n) approximate nearest neighbor queries.

SCALABILITY_PROFILE

Entity Capacity:1,000,000+ with sub-ms queries

Batch Ingestion:40GB+ per session

Connection Pooling:PgBouncer transaction mode

Concurrent Workers:20 parallel extraction jobs

SECTION_02

AGENTIC RAG SYSTEM

The intelligence engine at Blueprint's core. A ReAct-based autonomous agent that reasons through complex queries using a suite of 24 specialized tools. Not just retrieval—true multi-step reasoning with tool orchestration.

REACT_LOOP_EXECUTOR

Thought → Action → Observation

The agent operates in a reasoning loop: it thinks about the query, selects and executes tools, observes results, and iterates until a complete answer emerges. Up to 100 iterations with 32K token context windows.

THOUGHT: Analyze user query, plan approach
ACTION: Select tool, construct parameters
OBSERVATION: Process results, update context
LOOP: Continue until answer is complete

TOOL_INVENTORY (24 TOOLS)

Search & Read9 TOOLS

Entity search, semantic search, event search, document search, RSS feeds, relations, alerts, workspace stats

Actions5 TOOLS

Create entity, update entity, add relations, create investigation, create alert

Multi-Hop Reasoning4 TOOLS

Batch entity lookup, batch relations, find common connections, execute workflow

Output & Analysis6 TOOLS

Generate report, list playbooks, get playbook, OSINT lookups (GreyNoise, VirusTotal), render canvas

AGENT_CANVAS_SYSTEM

Rich Visualization Rendering

The agent can render visualizations directly into a split-view canvas area. Seven canvas types support different analytical outputs. SSE streaming enables real-time canvas updates as the agent processes queries.

GRAPH

Force-directed networks

TABLE

Structured data grids

TIMELINE

Temporal sequences

MAP

Geospatial plots

METRICS

KPI dashboards

DOCUMENT

Formatted reports

SECTION_03

ENTITY EXTRACTION PIPELINE

Automated intelligence extraction from any data source. AI-powered entity recognition with fuzzy deduplication, alias resolution, and relationship inference. Production-grade queue architecture for high-throughput processing.

UNIFIED_EXTRACTION_QUEUE

BullMQ + Redis

Crash-resilient job processing

All extraction jobs—RSS items, documents, scrapes—flow through a unified BullMQ queue backed by Redis. 20 concurrent workers process jobs in parallel with 1800 RPM rate limiting to stay within LLM API quotas.

WORKERS: 20 concurrent
RATE_LIMIT: 1800 requests/minute
IDEMPOTENT: Skips already-processed items
RESILIENT: Survives restarts, resumes from position

CIRCUIT_BREAKERS

Fault Isolation

Per-source failure handling

Separate circuit breakers for RSS, documents, scrape, and manual sources prevent cascade failures. If one source type fails repeatedly, only that source pauses— other sources continue processing normally.

THRESHOLD

5 failures

COOLDOWN

30 seconds

HALF_OPEN

2 test requests

AUTO_RESET

On success

ENTITY_DEDUPLICATION

Smart Resolution

Fuzzy matching + cross-type detection

Prevents duplicate entities using Levenshtein distance scoring. Cross-type deduplication finds same-name entities classified differently. Three outcomes: LINK (matched), REVIEW (uncertain), CREATE (new entity).

ALGORITHM: Levenshtein distance + alias matching
THRESHOLD: ≥85% similarity for cross-type matches
PENALTY: -0.05 for type mismatch (reduced from -0.2)

SECTION_04

GUARDRAILS & SECURITY

Multi-layer safety controls for autonomous AI operation. Input validation, output filtering, human-in-the-loop approval, and comprehensive tracing. Responsible AI by design.

INPUT_GUARDRAILS

Injection Detection

→Prompt injection blocking
→SQL injection detection
→Query length limits (10K chars)

OUTPUT_GUARDRAILS

PII Filtering

→SSN redaction
→Credit card filtering
→Configurable filter types

ACTION_GUARDRAILS

Human-in-the-Loop

→Approval-required actions
→Rate limiting (30/min)
→Action expiration timeout

TRACING_SYSTEM

Full Observability

→10 span types tracked
→Timing metrics per span
→Full I/O capture

SECTION_05

DATA ARCHITECTURE

PostgreSQL at the core with specialized indexing strategies for different query patterns. Full-text search, vector similarity, and graph traversal—all in a single database.

FULL_TEXT_SEARCH

PostgreSQL GIN

tsvector-indexed text search across entity names, descriptions, and document content. Sub-50ms queries on 1M+ rows with proper GIN indexing.

Index type: GIN
Query time: <50ms at 1M rows

VECTOR_SIMILARITY

pgvector + HNSW

Semantic similarity search via vector embeddings. HNSW indexing provides O(log n) approximate nearest neighbor queries for fast semantic retrieval.

Index type: HNSW
Complexity: O(log n) ANN

GRAPH_RELATIONS

Relational + Traversal

Entity relationships stored in dedicated table with typed relations. Recursive CTEs enable multi-hop graph traversal up to 100 hops for connection discovery.

Max traversal: 100 hops
Relations/entity: 100 max

ENTERPRISE_INQUIRY

READY TO DEPLOY.

Blueprint is built for enterprise deployment. On-premise installation, private cloud options, and dedicated support available for qualified organizations.

REQUEST ENTERPRISE ACCESS ← BACK TO OVERVIEW

FOR PARTNERSHIP INQUIRIES: NICK@GRUPPEPROJEKT.COM

← RETURN_TO_OVERVIEW

BLUEPRINT_TECH_SPEC_V0.8.8 | CLASSIFICATION: PUBLIC | STATUS: OPERATIONAL