Anthropic, OpenAI, and Cursor each shipped an official skill or plugin directory this week, pulling the agent-skills scramble out of scattered GitHub gists and into vendor-curated catalogs. The five repositories below work the layer underneath and the layer on top, from where a model runs to what an agent finally ships.
Honcho
A FastAPI memory service for stateful agents. You store messages and events on per-peer sessions, and it reasons in the background to build queryable representations of users, agents, and groups over time. Unlike chunk-matching vector stores, it extracts conclusions. It runs as a managed API or self-hosted via Docker, with Python and TypeScript SDKs.
Stop Slop
A skill file that teaches Claude or any LLM to catch and remove AI writing patterns, including throat-clearing openers, banned phrases, em dashes, binary contrasts, passive voice, and metronomic rhythm. It ships SKILL.md plus reference lists and a 1-10 scoring rubric, loading them on demand inside Claude Code, Projects, or an API system prompt.
Get Implicator.ai in your inbox
Strategic AI news from San Francisco. No hype, no "AI will change everything" throat clearing. Just what moved, who won, and why it matters. Daily at 6am PST.
No spam. Unsubscribe anytime.
LiteParse
A Rust document parser from the LlamaIndex team that runs locally with no cloud calls. It extracts spatial text and bounding boxes from PDFs through PDFium, offers selective Tesseract or HTTP OCR, and generates page screenshots. The lit CLI plus Node, Python, and WASM bindings target developers feeding clean text into LLM pipelines.
bumblebee
Perplexity's read-only inventory collector reads on-disk lockfiles, package-manager metadata, editor and browser extension manifests, and MCP host configs on macOS and Linux developer machines. It emits structured NDJSON records and, given an exposure catalog, flags exact matches, answering which laptops show a named compromised package right now. It runs no package managers and reads no source.
DwarfStar
Written by Redis creator Salvatore Sanfilippo, DwarfStar is a self-contained C engine that runs DeepSeek V4 Flash on Apple Silicon Metal or CUDA, with a 512GB path for the larger PRO model. Unlike generic GGUF runners, it bundles model-specific loading, tool calling, an on-disk KV cache, an HTTP server, and a coding agent into one binary.
DwarfStar
Honcho, LiteParse, and bumblebee all assume the model itself runs somewhere else, usually behind a vendor API. DwarfStar inverts that assumption. Salvatore Sanfilippo, who created Redis, spent May writing a single-file C engine that runs DeepSeek V4 Flash on a 96GB personal machine, bundling tool calling, an on-disk KV cache, and an HTTP server into the binary. The README credits heavy GPT-5.5 assistance for the code, and the project already carries 122 open issues against 12,400 stars three weeks after its first commit on May 6.
Treat it as a benchmark of your own hardware rather than a production runtime. Clone it onto a high-memory Apple Silicon laptop or a CUDA box, pick the matching make target, pull Sanfilippo's quantized weights, and measure tokens per second and memory headroom on a real prompt against whatever API you pay for today. Success is a clear read on what a frontier-class open model actually costs to run in-house. The CPU path stays off the table until the macOS virtual-memory bug flagged in the README is fixed.
View DwarfStar on GitHub →Frequently Asked Questions
How were these projects selected?
Current GitHub metadata, recent activity, README clarity, practical setup path, and relevance to builders working with AI systems.
Are stars enough?
No. Stars measure attention. Push dates, license, issues, docs, and whether the project solves a specific workflow decide usefulness.
What does the difficulty score mean?
It estimates how hard the project is to test or adapt, not how impressive the underlying engineering is.
Which repo should readers try first?
Stop Slop is the easiest test, since it is a skill file with nothing to build. DwarfStar is the more strategic experiment for teams weighing local inference against vendor APIs.
What should teams check before production use?
License, data retention, credential access, update speed, maintainer responsiveness, and whether the repo has a realistic rollback path.
AI-generated summary, reviewed by an editor. More on our AI guidelines.



IMPLICATOR