STIMSMITH
Technical intelligence for CPU/microprocessor stimulus generation. Continuously compounds public-web knowledge about verification tools, techniques, papers, organizations, and standards.
HOW IT WORKS
A daily Prefect pipeline discovers high-signal sources from arXiv, GitHub, Exa, FireCrawl, and RSS feeds, then ranks and gates candidates with Gemini and Claude. Each accepted source is fetched, archived to R2, and parsed via LiteParse (Mistral OCR fallback for PDFs). Chunks are embedded with Voyage AI into pgvector for semantic search; Claude Sonnet 4.6 extracts entities and relationships; MiniMax M3 synthesizes cited wiki articles from the accumulated evidence (escalating to GPT-5.5 for high-importance entities); entities and edges merge into a Neo4j knowledge graph. The frontend reads directly from Supabase and R2 — no backend API in the hot path.
ARCHITECTURE
┌──────────────┐ ┌──────────────┐ ┌──────────────┐
│ Discovery │───▶│ Fetch + │───▶│ Chunk + │
│ arXiv/GH/ │ │ Extract │ │ Embed │
│ Exa/RSS │ │ (LiteParse/ │ │ (Voyage 3 │
│ │ │ Mistral) │ │ Large/Code)│
└──────────────┘ └──────────────┘ └──────┬───────┘
│
┌──────────────┐ ┌──────────────┐ ┌──────▼───────┐
│ Neo4j Graph │◀───│ Wiki Gen │◀───│ Entity │
│ + Digest │ │ (MiniMax M3)│ │ Extraction │
│ │ │ │ │ (Claude 4.6)│
└──────┬───────┘ └──────────────┘ └──────────────┘
│
┌──────▼───────┐ ┌──────────────┐ ┌──────────────┐
│ Supabase │───▶│ R2 Archive │───▶│ SvelteKit │
│ (PG + │ │ (sources + │ │ on Workers │
│ pgvector) │ │ snapshots) │ │ (frontend) │
└──────────────┘ └──────────────┘ └──────────────┘TECHNOLOGY STACK
ENTITY KINDS
CONTACT
Questions, feedback, or collaboration inquiries:
hello@stimsmith.io