<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Ayaan Motiwala — Blog</title>
    <link>https://ayaanmotiwala.com/blog</link>
    <description>AI Specialist from Surat building production multi-LLM systems, voice calling agents, and automations that survive real users. Book a consultation.</description>
    <language>en</language>
    <atom:link href="https://ayaanmotiwala.com/rss.xml" rel="self" type="application/rss+xml"/>
    <item>
      <title>How to Build a Production-Ready Multi-LLM System: A 2026 Architecture Guide</title>
      <link>https://ayaanmotiwala.com/blog/multi-llm-system-architecture</link>
      <guid>https://ayaanmotiwala.com/blog/multi-llm-system-architecture</guid>
      <pubDate>Mon, 15 Jun 2026 00:00:00 GMT</pubDate>
      <description>A deep architecture guide to multi-LLM systems — model routing, fallbacks, cost instrumentation, and caching — from someone who runs these in production and cut a client's model bill 40–60%.</description>
      <category>AI</category>
    </item>
    <item>
      <title>Building an AI Voice Calling Agent: A Complete 2026 Walkthrough</title>
      <link>https://ayaanmotiwala.com/blog/ai-voice-calling-agent-walkthrough</link>
      <guid>https://ayaanmotiwala.com/blog/ai-voice-calling-agent-walkthrough</guid>
      <pubDate>Mon, 08 Jun 2026 00:00:00 GMT</pubDate>
      <description>How to build an AI voice calling agent that holds real phone conversations — the STT to LLM to TTS pipeline, sub-second latency, interruption handling, and clean human handoff. Built from a live production system.</description>
      <category>Voice AI</category>
    </item>
    <item>
      <title>RAG Explained: Building Retrieval-Augmented Generation with LangChain</title>
      <link>https://ayaanmotiwala.com/blog/langchain-rag-tutorial</link>
      <guid>https://ayaanmotiwala.com/blog/langchain-rag-tutorial</guid>
      <pubDate>Thu, 28 May 2026 00:00:00 GMT</pubDate>
      <description>A practical LangChain RAG tutorial that goes past the demo — chunking strategy, embedding choice, hybrid search, evaluation, and the source-citation grounding that keeps a chatbot from making things up.</description>
      <category>AI</category>
    </item>
    <item>
      <title>n8n vs Make: Which Automation Platform Should You Use for AI Workflows in 2026?</title>
      <link>https://ayaanmotiwala.com/blog/n8n-vs-make-ai-workflows</link>
      <guid>https://ayaanmotiwala.com/blog/n8n-vs-make-ai-workflows</guid>
      <pubDate>Fri, 15 May 2026 00:00:00 GMT</pubDate>
      <description>A hands-on n8n vs Make comparison for AI automation — pricing, AI nodes, self-hosting, error handling, and which one I actually reach for on client builds, with a clear decision rule.</description>
      <category>Automations</category>
    </item>
    <item>
      <title>FastAPI for AI Apps: Serving LLMs in Production Without the 2am Pages</title>
      <link>https://ayaanmotiwala.com/blog/fastapi-llm-deployment</link>
      <guid>https://ayaanmotiwala.com/blog/fastapi-llm-deployment</guid>
      <pubDate>Sat, 02 May 2026 00:00:00 GMT</pubDate>
      <description>How to serve LLMs in production with FastAPI — async streaming endpoints, auth, rate limiting, caching, and observability. The production scaffolding I rebuilt one too many times, explained.</description>
      <category>AI</category>
    </item>
    <item>
      <title>Choosing a Vector Database in 2026: pgvector vs Pinecone vs Chroma</title>
      <link>https://ayaanmotiwala.com/blog/vector-database-comparison</link>
      <guid>https://ayaanmotiwala.com/blog/vector-database-comparison</guid>
      <pubDate>Mon, 20 Apr 2026 00:00:00 GMT</pubDate>
      <description>A practical vector database comparison for RAG — pgvector vs Pinecone vs Chroma on cost, scale, ops, and filtering. Which one I default to, when I switch, and the decision rule I use on client builds.</description>
      <category>AI</category>
    </item>
  </channel>
</rss>