๐Ÿ“Š Enhanced RAG CSV Chatbot

Upload CSV files and chat with your data using powerful local language models

๐Ÿ“ Step 1 - Configure and Initialize RAG Pipeline

Text Splitting Strategy

How to split CSV data into chunks

Vector Database

Vector storage backend

Chunk Size

Size of text chunks for processing

๐Ÿค– Step 2 - Configure and Load LLM

Available Models

Choose the language model (will be loaded locally with 4-bit quantization)

0.1 1
128 512
1 5

๐Ÿ’ฌ Step 3 - Chat with Your CSV Data

Top 3 most relevant sources from your CSV data:

๐Ÿ’ก Tips for Better Results

  • Start simple: Ask basic questions first (e.g., "What columns are in this data?")
  • Be specific: Ask about particular columns or values (e.g., "What are the highest sales values?")
  • Request summaries: Ask for overviews (e.g., "Summarize the key patterns in this dataset")
  • Compare data: Ask comparative questions (e.g., "Compare category A vs category B")
  • Be patient: Local models may take 30-60 seconds to load and respond
  • Check sources: Review the source references to understand what data informed the answer

๐Ÿ”ง Model Information

Available Models:

  • Qwen2.5-7B-Instruct: Advanced Chinese-English bilingual model, excellent for analysis
  • Llama-3.1-8B-Instruct: Meta's powerful instruction-following model

Note: Models are loaded locally with 4-bit quantization for memory efficiency. First load may take several minutes.

๐Ÿ”ง Troubleshooting

If you get errors:

  • Wait for model loading to complete (check progress messages)
  • Ensure sufficient GPU/RAM memory (models use ~4-6GB)
  • Try simpler questions if responses are incomplete
  • Make sure your CSV files have clear column headers
  • Recreate the database if issues persist