DebuggerCafe - Deep Learning, Machine Learning, Artificial Intelligence

Building a RAG Application with Nemotron 3 Nano Omni

In this article, we build a RAG application using the Nemotron 3 Nano Omni mode. The model is deployed on Modal and the frontend is powered by Gradio. ...

Getting Started with NVIDIA LocateAnything

Sovit Ranjan Rath July 27, 2026 0 Comment

In this article, we explore the NVIDIA latest VLM, LocateAnything, which is capable of object detection, object pointing, text detection & OCR, and GUI grounding. ...

Deploying Nemotron 3 Nano Omni on Modal Serverless

Sovit Ranjan Rath July 20, 2026 0 Comments

In this article, we are deploying the NVIDIA Nemotron 3 Nano Omni model on the Modal Serverless L40S GPU for text, image, and video understanding chat. ...

Introduction to NVIDIA Nemotron 3 Nano Omni

Sovit Ranjan Rath July 13, 2026 2 Comments

In this article, we cover an introduction to the latest NVIDIA Nemotron 3 Nano Omni model and create a simple chat application using the NVIDIA API. ...

Fine-Tuning PaliGemma 2 for Object Detection

Sovit Ranjan Rath July 6, 2026 0 Comment

In this article, we fine-tune the PaliGemma 2 model for object detection. We specifically tune the model for wheat head detection. ...