AI Model Stack
Vercel AI SDK Integration
import { StreamingTextResponse, LangChainStream } from 'ai';
import { KimiAdapter } from './adapters/kimi-adapter';
// Core implementation
const { stream, handlers } = LangChainStream();
const model = new KimiAdapter({
model: 'kimi-k2-thinking',
temperature: 0.7,
maxTokens: 8000,
tools: [
'tavily-search',
'file-extract',
'image-analyze',
'calculator'
]
});
return new StreamingTextResponse(
model.streamPrompt(prompt, {
...handlers,
onToolCall: handleToolInvocation
})
);Available Models
Kimi K2 Thinking
PrimaryMoonshot AI's advanced reasoning model
Context:200K tokens
Use:Deep reasoning
Huggingface Mixtral
SecondaryFast inference for simpler tasks
Context:32K tokens
Use:Quick queries
Vision Models
SpecializedImage analysis and OCR
- • microsoft/trocr-large-printed (OCR)
- • Salesforce/blip-image-captioning-base
- • facebook/detr-resnet-50 (object detection)
Context Management
StrategySliding window with 160K token context
PersistenceTemporary (session-based)
SummarizationAutomatic when approaching limits
Cost OptimizationSmart truncation of old messages