Claude Directory

Back to All Skills

Best Claude Code Skills for Llm

19 skills tagged with “llm”

Reference for the Claude API / Anthropic SDK — model ids, pricing, params, streaming, tool use, MCP, agents, caching, token counting, model migration. TRIGGER — read BEFORE opening the target file; don't skip because it "looks like a one-liner" — whenever: the prompt names Claud…

Guide for creating high-quality MCP (Model Context Protocol) servers that enable LLMs to interact with external services through well-designed tools. Use when building MCP servers to integrate external APIs or services, whether in Python (FastMCP) or Node/TypeScript (MCP SDK).

pythontypescriptnodeapi+1

Senior Ml Engineer

ML engineering skill for productionizing models, building MLOps pipelines, and integrating LLMs. Covers model deployment, feature stores, drift monitoring, RAG systems, and cost optimization. Use when the user asks about deploying ML models to production, setting up MLOps infras…

kubernetesdockerperformancedeployment+5

Senior Prompt Engineer

Use when the user asks to optimize prompts, design prompt templates, evaluate LLM outputs with an eval set, measure RAG retrieval quality, validate agent/tool configurations, analyze token usage, or design structured-output contracts. Covers eval-driven prompt iteration, RAG met…

pythonaillmrag+1

Multi-agent collaboration plugin that spawns N parallel subagents competing on the same task via git worktree isolation. Agents work independently, results are evaluated by metric or LLM judge, and the best branch is merged. Use when: user wants multiple approaches tried in para…

Evaluate and rank agent results by metric or LLM judge for an AgentHub session. Use when the user runs /hub:eval or asks to score, compare, or pick a winner among completed AgentHub agents.

Use when writing, reviewing, or committing code to enforce Karpathy's 4 coding principles — surface assumptions before coding, keep it simple, make surgical changes, define verifiable goals. Triggers on "review my diff", "check complexity", "am I overcomplicating this", "karpath…

Llm Cost Optimizer

Use proactively whenever LLM API costs come up -- or should. Triggers include: 'my AI costs are too high', 'optimize token usage', 'which model should I use', 'LLM spend is out of control', 'implement prompt caching', 'we're about to launch an AI feature', 'build me an AI endpoi…

Use when building or maintaining a persistent personal knowledge base (second brain) in Obsidian where an LLM incrementally ingests sources, updates entity/concept pages, maintains cross-references, and keeps a synthesis current. Triggers include "second brain", "Obsidian wiki",…

Prompt Governance

Use when managing prompts in production at scale: versioning prompts, running A/B tests on prompts, building prompt registries, preventing prompt regressions, or creating eval pipelines for production AI features. Triggers: 'manage prompts in production', 'prompt versioning', 'p…

Agent Workflow Designer

Design production-grade multi-agent workflows with clear pattern choice (sequential, parallel, hierarchical), handoff contracts, failure handling, and cost/context controls. Use when architecting a multi-step agent pipeline, choosing between single-agent vs multi-agent approache…

Use when the user asks to design a RAG pipeline, choose a chunking strategy or embedding model, pick a vector database, or evaluate retrieval quality (precision@k, recall@k, NDCG). Examples: 'design a RAG system for our docs', 'what chunk size should I use for this corpus', 'eva…

aillmembeddingrag+1

Pre-production audit that scans a codebase for security, database, deployment, code quality, AI/LLM, dependency, frontend, and observability issues. Intercepts deploy commands and blocks until critical items pass. Stack-agnostic. Use for "run ship gate", "am I ready to ship", "p…

gosecuritydeploymentai+1

Answer Engine Optimization (AEO) skill — optimize content to be cited by AI language models (ChatGPT, Perplexity, Claude, Gemini, Mistral) as authoritative sources. Distinct from SEO — AEO optimizes for citation in LLM-generated responses, not search rankings. Use when planning …

Prompt Engineer Toolkit

Turns marketing prompts into tested, versioned production assets: A/B prompt evaluation against structured test cases, immutable prompt version history with diffs, ready-to-use marketing prompt templates (ad copy, email campaigns, social posts, landing pages, SEO meta), and an L…

Cloudflare Agents

Build AI agents on Cloudflare Workers with MCP integration, tool use, and LLM providers.

cloudflareaillmagent

Cloudflare Workers Ai

Cloudflare Workers AI for serverless GPU inference. Use for LLMs, text/image generation, embeddings, or encountering AI_ERROR, rate limits, token exceeded errors.

cloudflareaillmembedding

Firecrawl Scraper

Firecrawl v2.5 API for web scraping/crawling to LLM-ready markdown. Use for site extraction, dynamic content, or encountering JavaScript rendering, bot detection, content loading errors.

javascriptscrapingapillm

Build a regression + eval harness for AI-written code and AI features. Generates characterization tests that lock current behavior before a refactor, scaffolds a Promptfoo eval suite for chatbots/RAG/classifiers, and wires it into the ship-gate. Use when the user wants evals, re…

Drill down: Llm setups by use case

Llm for AI & Agent Development →

Browse more topics

agent ai anthropic api architecture audit automation aws azure benchmarking bisect browser build captions changelog charts claude cleanup cloudflare code-quality code-review commit compose containers content context coverage csv d3 data database debugging dependencies deployment development devops devtools diagrams discord django docker documentation documents docx e2e efficiency embedding engineering-code-quality environment excel explanation favicon files flask frontend gcp git github gitlab go graphql hygiene images indexes infrastructure ios javascript jira kotlin kubernetes linear llm maintenance mcp memory mermaid meta meta-tags migrations mobile monitoring monorepo mysql nextjs node npm official onboarding openapi optimization owasp packages parsing patterns pdf performance planning playwright postgres powerpoint pptx pr productivity profiling protocol prototyping pwa python qa query-optimization rag rapid-development react redis refactoring regex regression release reports rest review rust schema scraping sdk sdlc security server setup skills slack slides spreadsheets sql sqlite swagger swift tdd test-generation testing tokens transcript typescript utility validation vercel vibe-coding video visualization vue vulnerabilities web word workflow workflows workspace xcode xlsx youtube