What is a Knowledge Base?
A Knowledge Base in Pathlit is a searchable collection of documents (text, PDFs, web pages, and more) that are chunked, embedded, and stored in a vector index. You use knowledge bases to:- Power RAG (Retrieval Augmented Generation) — Retrieve relevant chunks for a query and pass them as context to an LLM Node so answers are grounded in your content.
- Semantic search — Find documents or passages by meaning, not just keywords.
- Centralize content — Keep product docs, support articles, internal wikis, or scraped web content in one place and query them from workflows.
Knowledge bases are available on plans that include the Knowledge Bases entitlement. You create and manage them in the Pathlit app (e.g. Knowledge Base or Document Index section in your organization).
Creating a Knowledge Base
Create a knowledge base from the Pathlit app (e.g. Knowledge Base page in your organization). You will be asked for:- Name — A short name (letters, numbers, underscores, spaces). Used to identify the index in the app and in workflow nodes.
- Description — Optional short description of what the knowledge base is for.
- Embedding model — Model used to compute dense embeddings for chunks. Options typically include:
- text-embedding-3-small — Fast and cost-effective (default).
- text-embedding-3-large — Higher quality, larger dimension.
- text-embedding-ada-002 — Legacy option.
- Sparse embedding model (optional) — If set, you can use Sparse or Hybrid retrieval when searching (e.g. in the Agent Node Knowledge Base Search tool or the Knowledge Base Search Node) for better recall on keyword-heavy queries. Options include BM25 and Splade PP English.
Adding Content
You can add content in several ways:1. Upload files (app or API)
From the Pathlit app, open the knowledge base and upload files (PDF, Word, text, and other supported types). Files are chunked and embedded automatically. You can also use the ingest-files API to upload documents programmatically.2. Google Drive sync (app or API)
Sync files from Google Drive into the knowledge base. The app/API lets you configure which folders or files to index. Content is processed and embedded like uploaded files.3. Website scraping (app or API)
Scrape a website to index its pages into the knowledge base. Useful for documentation sites, blogs, or public pages. Configure the URL and crawl options; scraped content is chunked and embedded.4. Knowledge Base Add Node (workflows)
Use the Knowledge Base Add Node inside a workflow to add plain text to a knowledge base. The node accepts:- Knowledge Base — The target index (from the dropdown).
- File Content — The text to add (supports format strings from workflow input, e.g.
{llm_output}). - Optional File Name Prefix and Wait for indexing to complete.
- Writing summaries or extracted data from an LLM or Document Data node into a knowledge base.
- Indexing customer feedback or support tickets from previous steps.
- Building a knowledge base dynamically from workflow outputs.
Searching a Knowledge Base
Recommended: Agent Node with Knowledge Base Search tool Use the Agent Node with the Knowledge Base Search tool enabled and your knowledge base selected. The agent receives the user prompt (e.g. a question) and can call the knowledge base search tool when it needs to look up information—so it decides when to search, how to phrase queries, and how to combine results with other tools (e.g. web search). This is the preferred approach for RAG chatbots, Q&A, and flexible workflows.- Add an Agent Node to your workflow.
- In the node’s Tools configuration, enable Knowledge Base Search and select your knowledge base.
- Provide a prompt (e.g. Answer the user’s question using the knowledge base. If the answer isn’t in the knowledge base, say so.). The agent will call the search tool as needed and use the retrieved chunks in its response.
RAG workflow example
Recommended: Agent with Knowledge Base Search tool- Trigger (e.g. chatbot, webhook, or form) captures a user question.
- Agent Node — Enable the Knowledge Base Search tool and select your knowledge base. Prompt: Answer the user’s question using the knowledge base. Only use information from the search results; if the answer isn’t there, say so. The agent will search the knowledge base when needed and respond with grounded answers.
- Trigger captures a user question.
- Knowledge Base Search Node — Search String
{user_question}; output = retrieved chunks. - LLM Node — System prompt: Answer using only the following context. If the answer is not in the context, say so. User prompt: include the retrieved chunks and the user question.
Summary
| Step | Where | What |
|---|---|---|
| Create | Pathlit app → Knowledge Base | Name, description, embedding model, optional sparse model |
| Add content | App (upload / GDrive / website) or Knowledge Base Add node | Files, URLs, or plain text |
| Search | Recommended: Agent node with Knowledge Base Search tool | Agent decides when/how to search and combines with other tools |
| Search (alternative) | Knowledge Base Search node | Single fixed search step → retrieved chunks (and optional citations) |
| Use in RAG | Agent (with KB tool) or LLM node | Agent uses tool results; or pass KB Search node output as context to LLM |