DebuggerCafe - Deep Learning, Machine Learning, Artificial Intelligence

Getting Started with NVIDIA LocateAnything

In this article, we explore the NVIDIA latest VLM, LocateAnything, which is capable of object detection, object pointing, text detection & OCR, and GUI grounding. ...

Deploying Nemotron 3 Nano Omni on Modal Serverless

Sovit Ranjan Rath July 20, 2026 0 Comments

In this article, we are deploying the NVIDIA Nemotron 3 Nano Omni model on the Modal Serverless L40S GPU for text, image, and video understanding chat. ...

Introduction to NVIDIA Nemotron 3 Nano Omni

Sovit Ranjan Rath July 13, 2026 2 Comments

In this article, we cover an introduction to the latest NVIDIA Nemotron 3 Nano Omni model and create a simple chat application using the NVIDIA API. ...

Fine-Tuning PaliGemma 2 for Object Detection

Sovit Ranjan Rath July 6, 2026 0 Comment

In this article, we fine-tune the PaliGemma 2 model for object detection. We specifically tune the model for wheat head detection. ...

Gemma 4 Text Fine-Tuning

Sovit Ranjan Rath June 29, 2026 0 Comment

In this article, we are fine-tuning Gemma 4 for text on a reasoning dataset whose responses have been collected from DeepSeek-V4 Flash model. ...