Having Conversations
Once a model is loaded, you can start chatting.Sending Messages
- Type your message in the input box
- Press Enter or click Send
- Wait for the model to respond
- Continue the conversation
Conversation Context
The chat maintains conversation history:- Each message you send includes previous context
- The model “remembers” what you’ve discussed
- Longer conversations use more memory
Context Window
Models have a maximum context length:| Model | Context Length |
|---|---|
| Llama 3.2 (1B/3B) | 128K tokens |
| Llama 3.1 | 128K tokens |
| Mistral 7B v0.3 | 32K tokens |
| Gemma 2 | 8K tokens |
| Qwen 2.5 | 128K tokens |
Context lengths vary by model version. Check the model card on Hugging Face for exact specifications.
Conversation Tips
For Testing Fine-tuned Models
Test with prompts similar to your training data:For Evaluating Quality
Ask questions that reveal model capabilities:- Factual: “What is the capital of France?”
- Reasoning: “If A > B and B > C, is A > C?”
- Creative: “Write a haiku about programming”
- Domain-specific: Questions from your fine-tuning domain
For Finding Issues
Test edge cases:- Very short inputs (“Hi”)
- Very long inputs
- Unusual characters or formatting
- Questions outside training domain
- Attempts to confuse the model
Clearing History
To start fresh:- Look for “Clear” or “New Chat” button
- Or reload the page
- Testing different scenarios
- Context gets too long
- Starting a new demo