mberman84 · February 11, 2026 19:38
diff --git a/gistfile1.txt b/gistfile1.txt
 # OpenClaw Implementation Prompts

 Each prompt below is a self-contained brief you can hand to an AI coding assistant (or use as a project spec) to build that use case from scratch. Adapt the specific services to whatever you already use — the patterns are what matter.

 ---

 ## 1) Personal CRM Intelligence

 ```
 Build me a personal CRM system that automatically tracks everyone I interact with, with smart filtering so it only adds real people — not newsletters, bots, or cold outreach.

 Data sources:
 - Connect to my email (Gmail API or IMAP) and scan the last 60 days of messages.
 - Connect to my calendar (Google Calendar API) and scan the last 60 days of events.
 - Run this ingestion on a daily cron schedule.

 Contact extraction from email:
 - Extract sender/recipient email addresses and names from messages.
 - Estimate the number of exchanges (back-and-forth threads, not just raw message count): Math.min(Math.floor(totalMessages / 2), threadCount).
 - Collect sample subject lines and message snippets for classification.

 Contact extraction from calendar:
 - Only include meetings with 1-10 attendees (skip large all-hands).
 - Only include meetings at least 15 minutes long (skip quick check-ins that are really just reminders).
 - Extract attendee names, emails, and the meeting title.

 Filtering — this is critical. Most contacts from email are noise. Use a two-stage filter:

 Stage 1 — Hard filters (always reject):
 - My own email addresses and domains.
 - Emails from family or personal contacts I've explicitly excluded (configurable list).
 - Contacts already in the CRM or previously rejected.
 - Generic role-based inboxes: info@, team@, partnerships@, collabs@, noreply@.
 - Marketing/transactional domains matching patterns like: noreply@, tx., cx., mail., email. prefixes.

 Stage 2 — AI classification (use a fast, cheap LLM like Gemini Flash or Haiku):
 Send the candidate's name, email, exchange count, and sample subject lines/snippets to an LLM with these rules:
 - REJECT clearly automated or notification-only senders.
 - REJECT if all sample subjects look like newsletters, digests, or automated reports ("weekly roundup", "monthly update", "AI news").
 - REJECT cold outreach with low engagement — if exchanges are low relative to total emails, it's one-way pitching.
 - REJECT if snippets show repetitive promotional content (product launches, feature announcements, affiliate reports).
 - APPROVE only if it looks like a real person with genuine two-way interaction or a meaningful business relationship.
 - Higher confidence for real back-and-forth conversations with varied, substantive topics.

 Contact scoring (used for ranking, not filtering):
 - Base score: 50
 - +5 per email exchange (max +20)
 - +3 per meeting (max +15)
 - +15 if their title matches preferred titles (CEO, Founder, VP, Head of, Engineer, Partner, etc.)
 - +10 if they appeared in small meetings (≤3 attendees)
 - +10 if last interaction was within 7 days, +5 if within 30 days
 - +25 bonus if the person appears in both email AND calendar (stronger signal)
 - +10 if they have a recognizable role, +5 if they have a company

 For each approved contact, store:
 - Name, email(s), company, role/context
 - Interaction timeline with dates
 - Last-touch timestamp (auto-updated)
 - Contact score
 - Tags or categories

 Learning system:
 - Maintain a learning.json config with:
  - skip_domains: domains to always reject (populated over time from rejections)
  - prefer_titles: titles that boost contact scores
  - skip_keywords: subject-line keywords that indicate spam
  - min_exchanges: minimum exchange threshold (default 1)
  - max_days_between: max days since last interaction (default 60)
  - max_attendees: meeting size cap (default 10)
  - min_duration_minutes: meeting length minimum (default 15)
 - When I reject a contact, learn from it — add their domain to skip_domains if appropriate.

 Deduplication:
 - When a new contact is found, check by email, then by name+company combination.
 - Merge records rather than creating duplicates.

 Semantic retrieval:
 - Generate embeddings for each contact record.
 - Let me ask natural-language questions like:
  - "Who did I meet from [company] last month?"
  - "When did I last talk to [name]?"
  - "Show contacts I haven't spoken to in 30+ days."

 Storage: SQLite with WAL mode and foreign keys enabled.

 Notifications: After each ingestion run, send a summary of new contacts, merges, rejections, and any issues.
 ```

 ---

 ## 2) Knowledge Base (RAG) — Save Anything, Recall It Later

 ```
 Build me a personal knowledge base with RAG (retrieval-augmented generation).

 Ingestion — I send a URL or file and the system saves it. It should handle:
 - Web articles
 - YouTube videos (transcripts)
 - Tweets/X posts
 - PDFs
 - Plain text or notes

 Source type detection: Determine the type from the URL pattern or file extension. Classify as: article, video, pdf, text, tweet, or other.

 Content extraction with fallback chain — this is important because no single extractor works for every site:
 1. For Twitter/X URLs:
   a. Try FxTwitter API (api.fxtwitter.com) — free, no auth needed
   b. Fall back to X API direct lookup
   c. Fall back to web scraping
 2. For YouTube URLs:
   a. Pull transcript via YouTube transcript API or yt-dlp
 3. For all other URLs (articles, blogs, etc.):
   a. Try a clean text extractor (like Mozilla Readability or similar)
   b. Fall back to Firecrawl or Apify for sites that block simple extraction
   c. Fall back to a headless browser (Playwright/Puppeteer) for JavaScript-heavy pages
   d. Last resort: raw HTTP fetch + HTML tag stripping
 - Retry once on transient errors (ECONNRESET, ETIMEDOUT, DNS failures) with a 2-second delay.

 Content quality validation — reject bad extractions:
 - Minimum 20 characters of content.
 - For articles/non-tweets: at least 15% of non-empty lines must be longer than 80 characters (to detect prose vs. navigation menus).
 - Total content must be at least 500 characters for non-tweet types.
 - Detect error pages by looking for 2+ signals: "access denied", "captcha", "please enable javascript", "cloudflare", "404", "sign in", "blocked", "rate limit".
 - Maximum content length: 200,000 characters (truncate beyond this).

 Deduplication — two layers:
 1. URL-based: Normalize URLs before comparing — strip tracking params (utm_source, utm_medium, utm_campaign, fbclid, igshid, ref, s, t), remove www., normalize twitter.com to x.com, remove trailing slashes and fragments.
 2. Content-hash: SHA-256 hash of the cleaned content. Store as a UNIQUE column — reject if the same hash already exists.

 Chunking:
 - Chunk size: 800 characters per chunk.
 - Overlap: 200 characters between chunks.
 - Minimum chunk size: 100 characters (append tiny remainders to the last chunk).
 - Split on sentence boundaries (regex: /(?<=[.!?])\s+/).

 Embedding generation:
 - Use Google gemini-embedding-001 (768 dimensions, free) or OpenAI text-embedding-3-small (1536 dimensions) as fallback.
 - Max input: 8000 characters per chunk.
 - Process in batches of 10 chunks with 200ms delay between batches.
 - Retry failed embeddings 3 times with exponential backoff (1s, 2s, 4s).
 - Cache embeddings with an LRU cache (1000 entries).

 Storage — SQLite with two tables:
 - sources: id, url, title, source_type, summary, raw_content, content_hash (UNIQUE), tags (JSON array), created_at, updated_at
 - chunks: id, source_id (FK), chunk_index, content, embedding (BLOB), embedding_dim, embedding_provider, embedding_model, created_at
 - Index on chunks(source_id), sources(source_type), sources(content_hash).
 - Enable WAL mode and foreign keys with CASCADE deletes.

 Concurrency protection: Use a lock file to prevent simultaneous ingestion runs. Check if lock is stale (PID dead or file older than 15 minutes).

 Retrieval — When I ask a question:
 1. Embed my query using the same embedding provider.
 2. Cosine similarity search over all stored chunks. Return top 10.
 3. Deduplicate results: keep only the best chunk per source.
 4. Sanitize content in results (max 2500 characters per excerpt).
 5. Pass top chunks to an LLM: "Answer using only the provided context. Cite which sources you drew from."
 6. Return the answer with source references.
 ```

 ---

 ## 3) Content Idea Pipeline (Research → Dedupe → Project Management)

 ```
 Build a content idea pipeline that researches topics, prevents duplicate ideas, and creates production-ready tasks.

 Trigger: I describe a topic idea in chat.

 Step 1 — Research:
 - Search X/Twitter for recent discourse on the topic (use the tiered approach from Use Case #4 if built).
 - Search my knowledge base (Use Case #2) for related saved content.
 - Optionally run a web search.

 Step 2 — Semantic dedupe (this is the critical gate):
 - Maintain a database of all past content ideas with these fields:
  - id (format: YYYY-MM-DD-NNN)
  - date, type (short/long-form), title, slug (URL-friendly, UNIQUE)
  - summary (the pitch text)
  - tags (comma-separated)
  - status: pitched, accepted, rejected, produced, duplicate
  - response (my feedback)
  - embedding (BLOB — generated via Gemini or OpenAI)
  - created_at

 - When a new idea comes in, run a hybrid similarity search:
  - Semantic similarity (70% weight): Cosine similarity between the new idea's embedding and all stored embeddings.
  - Keyword matching (30% weight): Match against title (30%), summary (20%), and tags (20%).
  - Combined score: (semantic × 0.7) + (keyword × 0.3)

 - Hard gate: If any existing idea scores above 40% combined similarity, REJECT the new idea. Show me what it matched and the similarity score. No exceptions — this prevents re-pitching old concepts.

 Step 3 — Brief assembly (only if it passes dedupe):
 - Keep it short — a few sentences on the angle and why it's video-worthy.
 - Do NOT generate titles, thumbnails, hooks, or scripts unless I explicitly ask.
 - Include links to relevant tweets or KB sources from the research step.

 Step 4 — Create task:
 - Create a task in my project management tool (Asana, Linear, Notion, or Todoist) with the brief as the description.

 Step 5 — Store the idea:
 - Add the new idea + its embedding to the pitch database so future ideas are compared against it.
 - Set initial status to "pitched".

 Step 6 — Notify me with the brief and a link to the created task.
 ```

 ---

 ## 4) Social Media Research System (Cost-Optimized)

 ```
 Build a social media research tool that answers "What are people saying about [topic]?" using X/Twitter, while minimizing API costs.

 Query decomposition:
 - Take my question and break it into 2-4 focused search queries covering different angles.

 Tiered retrieval — always try the cheapest option first:
 - Tier 1 (free): FxTwitter API (api.fxtwitter.com) — works for individual tweet lookups, no auth needed.
 - Tier 2 (low-cost, ~$0.15/1K tweets): A paid Twitter data provider like TwitterAPI.io or SocialData. Supports search, profiles, user tweets, and thread context.
 - Tier 3 (expensive, ~$0.005/tweet): Official X API v2 (api.x.com/2). Last resort only. Rate limit to 350ms between requests to stay under 450 req/15min.

 How the tiers cascade by operation:
 - Single tweet lookup: Tier 1 → Tier 2 → Tier 3
 - Search: Tier 2 → Tier 3
 - Profile lookup: Tier 2 → Tier 3
 - Thread expansion: Tier 2 → Tier 3

 Filtering:
 - Filter by timeframe (default: last 7 days, configurable).
 - Rank by engagement (likes + retweets + replies).
 - Remove retweets and duplicate content.
 - Suppress low-quality or spam results.

 Thread expansion: For high-engagement tweets that are part of a thread, pull the full thread.

 Caching: Cache results with a 1-hour TTL so repeated queries don't burn API credits.

 Usage logging: Log every API call to a separate log file per tier, with timestamps and estimated costs.

 Output — synthesize into a briefing:
 - Key narratives: 3-5 dominant takes
 - Notable posts: 5-10 most relevant tweets with links
 - Sentiment summary: positive, negative, or mixed
 - Contrarian takes: interesting minority opinions
 ```

 ---

 ## 5) YouTube Analytics + Competitor Tracking

 ```
 Build a YouTube analytics system that tracks my channel daily and monitors competitors.

 My channel — collect these metrics daily via YouTube Data API and YouTube Analytics API:
 - views (total and per-video)
 - estimatedMinutesWatched (watch time)
 - averageViewDuration
 - subscribersGained
 - impressions and CTR (if available via Analytics API)

 Store each day's metrics in a SQLite table:
 - daily_stats: video_id, date, views, watch_time_minutes, avg_view_duration, impressions, ctr, subscribers_gained
 - videos: id, title, published_at, thumbnail_url, duration_seconds

 Derived metrics:
 - 7-day moving average for daily views
 - Subscriber-to-view conversion rate
 - Views-per-video trend

 Competitor tracking:
 - Accept a configurable list of competitor channel IDs/usernames.
 - Daily, pull each competitor's recent uploads (title, publish date, views) and subscriber count via YouTube Data API.
 - Track upload cadence and momentum.

 Chart generation — produce PNG charts using matplotlib (dark theme):
 1. "trend" chart: Daily view trend for a specific video (line chart with fill area)
 2. "top" chart: Top N videos by views (horizontal bar chart, color-coded by short vs long-form)
 3. "daily" chart: Channel-wide daily views with 7-day moving average overlay
 - Format numbers with commas, dates as readable labels, include grid and legends.
 - Save charts to a designated output folder.

 Schedule: Daily cron job collecting all data. Optional daily summary sent to chat.
 ```

 ---

 ## 6) Nightly Business Briefing (Multi-Perspective AI Council)

 ```
 Build a nightly business analysis system that collects signals from across my tools and produces prioritized recommendations using a multi-persona AI review council.

 Signal collection — pull data from whatever systems you use. Normalize each signal into:
 { source, signal_name, value, confidence (0-100), direction (up/down/flat), category }

 Example sources: YouTube metrics, CRM health, project management backlog, social growth, email themes, meeting action items, sales pipeline, operations/cron reliability.

 Compact to the top 200 signals by confidence score, with up to 4 sample events per source.

 Three-phase AI review council (all phases use a frontier model like Claude Opus or GPT-4):

 Phase 1 — Draft (LeadAnalyst):
 - Score business outlook on weekly and monthly horizons (0-100), plus a blended average.
 - Generate 5-10 recommendations, each with: title, description (2-3 plain sentences), evidence references, estimated impact (0-100), estimated effort (0-100), confidence (0-100).
 - Constraints: use only provided data, no made-up facts, plain human language, don't recommend publishing content immediately.

 Phase 2 — Parallel review (4 personas, run simultaneously via Promise.all):
 1. GrowthStrategist: Scalable growth, asymmetric upside. Demands concrete evidence.
 2. RevenueGuardian: Near-term revenue and cash flow. Challenges anything that doesn't protect income.
 3. SkepticalOperator: Execution reality, data quality risks. Challenges weak assumptions.
 4. TeamDynamicsArchitect: Team health, collaboration quality, sustainable pace.

 Each reviewer:
 - Critiques every recommendation: support, revise, or reject (with score adjustments)
 - Can propose new recommendations the draft missed
 - Returns: reviewer_name, key_findings[], votes[] (with adjusted scores), new_recommendations[]

 Phase 3 — Consensus (CouncilModerator):
 - Receives draft + all 4 reviews.
 - Reconciles disagreements.
 - Produces final recommendation set with consensus notes.

 Fallback: If consensus fails, use draft recommendations. If a reviewer fails, substitute a stub review with the error message.

 Ranking — score each recommendation:
 priority = (impact × w1) + (confidence × w2) + ((100 - effort) × w3)
 Default weights: w1=0.4, w2=0.35, w3=0.25. Store weights in a policy table and update via feedback over time.

 Filter: Remove any "publish now" recommendations (hard constraint).

 Output: Deliver ranked recommendations to my preferred channel. Store full council trace (draft, all reviews, consensus, models used) in a database for auditing and backtesting.

 Schedule: Run nightly via cron.
 ```

 ---

 ## 7) CRM/Business Tool Natural Language Access

 ```
 Build a natural-language interface to my CRM (HubSpot, Salesforce, or whatever I use).

 Supported objects and operations:
 - Contacts: search, create, update, list, get by ID
 - Companies: search, create, update, list
 - Deals: search, create, update, list, query by pipeline stage
 - Owners: list
 - Associations: link contacts to companies, deals to contacts, deals to companies
  (For HubSpot, common association type IDs: 1=Contact→Company, 3=Deal→Contact, 5=Deal→Company)
 - Properties/schema: inspect available fields for any object

 Intent classification — parse my message into:
 1. Lookup: "Find the contact for [name]" → search contacts
 2. Create: "Add a new contact: [name], [email], [company]" → create contact
 3. Update: "Update [contact]'s title to [new title]" → update contact
 4. List: "Show all deals in negotiation stage" → filter deals
 5. Associate: "Link [contact] to [company]" → create association

 Validation: If I'm missing required fields, ask me before proceeding.

 Response format: Return clean, readable summaries — not raw JSON. For deal queries, include: deal name, amount, stage, close date, last modified.

 Authentication: API key or OAuth stored in environment variables.
 ```

 ---

 ## 8) AI Content Humanization

 ```
 Build a text rewriting tool that removes AI-generated artifacts and makes content sound like a real person wrote it.

 Input: I paste draft text (social posts, blog paragraphs, emails, video scripts).

 Step 1 — Detection. Scan for common AI tells:
 - Overuse of: "delve", "landscape", "leverage", "it's important to note", "in conclusion", "game-changing", "revolutionary", "transformative"
 - Tone inflation: dramatic language when the subject doesn't warrant it
 - Generic phrasing that could apply to any topic
 - Repetitive sentence structures (every sentence starting the same way)
 - Excessive hedging: "It's worth noting that perhaps..."
 - Lists that feel generated (too clean, too parallel, no personality)
 - Identical paragraph lengths and rhythms

 Step 2 — Rewrite:
 - Replace vague qualifiers with specific, concrete language
 - Vary sentence length (mix short punchy sentences with longer ones)
 - Use contractions, sentence fragments, and informal word choices where natural
 - Remove filler while keeping the core message
 - Add the kind of imperfections real writing has — not errors, but human cadence

 Step 3 — Channel tuning (optional, based on destination):
 - Twitter/X: Punchy, under 280 chars, direct, no filler
 - LinkedIn: Professional but conversational, authoritative without being stiff
 - Blog: Longer form, personal anecdotes and opinions welcome
 - Email: Brief, clear, action-oriented, warm but efficient

 Output: Return the revised text. Optionally highlight what changed and why.
 ```

 ---

 ## 9) Image Generation + Iterative Editing

 ```
 Build an image generation workflow where I describe what I need in chat and iterate until it's right.

 Core flow:
 1. I describe an image: subject, style, composition, mood, colors, any text to include.
 2. Generate 1-3 variants using an image generation API (DALL-E 3, Midjourney API, Stable Diffusion, or Flux).
 3. I review and either:
   a. Accept → save as final asset
   b. Request changes → I describe adjustments ("darker background", "remove text", "more minimal") and it generates new variants incorporating my feedback
 4. Loop continues until I'm satisfied.

 Image editing:
 - I can send an existing image and ask for edits: swap background, adjust composition, combine multiple images.
 - Use inpainting or img2img when available.

 Context tracking: Remember what we've been working on across multiple messages in the same session so I don't have to re-describe the whole concept each iteration.

 Output: Save final assets to a designated folder. Deliver as downloadable files in chat.
 ```

 ---

 ## 10) Task Management from Meetings + Chat

 ```
 Build a system that extracts action items from meetings and conversations, gets my approval, then creates tasks in my task manager.

 Input sources:
 - Meeting transcripts from Fathom, Otter.ai, Fireflies, or any transcript provider
 - Notes I paste into chat
 - Direct commands: "Add a task to follow up with [person] by Friday"

 Extraction — use an LLM (Gemini Flash or similar fast model) to parse transcripts and extract:
 - description: what needs to be done (max 150 characters, clear and actionable)
 - assignee: who is responsible (name as it appears in the transcript)
 - is_owner: boolean — is this MY action item or someone else's?
 - todoist_title: clean task title (max 120 chars) — only generate this for items where is_owner is true
 - Do NOT extract vague items like "think about X" — only concrete actions with a clear next step.

 CRM cross-reference (if you have Use Case #1 built):
 - Match mentioned people against CRM contacts to enrich with company/role context.

 Approval flow — this is important, don't auto-create tasks:
 1. Show me the extracted task list with each item numbered.
 2. Let me choose: "all", "none", or specific items by number (e.g. "1, 3, 5").
 3. Let me edit any task before confirming (change title, priority, due date).
 4. Only after I approve, create the tasks via the Todoist/Asana/Linear API.
 5. If any task creation fails, keep it in a pending state for retry.

 Direct task creation:
 - When I say "remind me to [thing] by [date]", skip extraction and create directly after confirming.

 Output: Send confirmation with links to created tasks. Persist approval state so I can check what's pending.

 Default project: Configure a default project/list for tasks (e.g. "Work" or my company name).
 ```

 ---

 ## 11) AI Usage and Cost Tracking

 ```
 Build a logging and reporting system that tracks every AI API call and provides cost visibility.

 Logging — wrap every LLM/AI API call with a logger that records this structure (one JSON object per line in a JSONL file):
 {
  "timestamp": "ISO8601",
  "model": "model-name",
  "tokens": { "input": number, "output": number, "total": number },
  "taskType": "string (e.g. crm-ingestion, x-research, image-gen, business-analysis)",
  "description": "string (brief note on what this call was for)",
  "costEstimate": number,
  "source": "tracker-name"
 }

 Cost calculation — use a pricing lookup table (per 1M tokens, input/output):
 - Anthropic: Opus ($15/$75), Sonnet ($3/$15), Haiku ($0.80/$4)
 - OpenAI: GPT-4 ($30/$60), GPT-4 Turbo ($10/$30), GPT-3.5 Turbo ($0.50/$1.50), o1 ($15/$60)
 - Google: Gemini Pro ($10/$30), Gemini Flash ($0.30/$1.20), Gemini 1.5 Pro ($1.25/$5)
 - xAI: Grok ($2/$10)
 - Default for unknown models: $1/$3
 - Formula: (inputTokens / 1M) × inputPrice + (outputTokens / 1M) × outputPrice
 - Keep this table in a config file so it's easy to update when pricing changes.

 Storage: Append-only JSONL file. Optionally mirror to SQLite for faster querying.

 Reporting — generate on demand with filters (--days N, --model X, --task-type Y):
 1. Overall summary: total calls, total tokens (input/output/total), total cost
 2. By model: calls, tokens, cost — sorted by cost descending
 3. By task type: calls, tokens, cost — sorted by cost descending
 4. By day: last 10 days showing calls, tokens, and cost per day
 5. Trend: daily or weekly spending over the last 30/90 days

 Routing suggestions:
 - Flag any task type where a frontier model (Opus, GPT-4) is being used but the task is simple (short inputs, formatting-only, data extraction) — suggest downgrading to a cheaper model.
 - Flag workflows that account for >25% of total spend as optimization candidates.
 - Suggest caching strategies for repeated queries (especially research workflows).

 Output: Markdown report or simple web dashboard. Optionally send weekly cost summary to my chat.
 ```
No results found