In this article, we add RAG as a tool call to gpt-oss-chat where we let the assistant decide when to search a user provided document via Qdrant in-memory DB. ...
RAG Tool Call for gpt-oss-chat
In this article, we add RAG as a tool call to gpt-oss-chat where we let the assistant decide when to search a user provided document via Qdrant in-memory DB. ...
In this article, we add web search tool call to gpt-oss-chat CLI mode. We cover the definition of tools, how to handle streaming with tool call, and other caveats. ...
In this article, we work on gpt-oss-chat. A local user friendly chat interface powered by gpt-oss-20b, with local RAG and web search capabilities. ...
In this article, we create a simple SAM 3 Gradio UI for image and video segmentation. SAM 3 UI supports segmenting objects belonging to the different categories while using less than 10GB VRAM. ...
In this article, we explore the gpt-oss model card and run inference with gpt-oss-20b using llama.cpp locally. ...
Business WordPress Theme copyright 2025