Skip to content
DebuggerCafe

Machine Learning and Deep Learning

  • ABOUT
  • CONTACT
  • DCHUB
  • DebuggerCafe
  • Privacy Policy
  • Projects
  • Topics
Close Menu

Grounding Qwen3-VL Detection with SAM2

Sovit Ranjan RathSovit Ranjan Rath January 5, 2026January 5, 2026 0 Comment
Grounding Qwen3-VL Detection with SAM2

In this article, are grounding the Qwen3-VL object detection capabilities with SAM2 segmentation. The pipeline uses Qwen3-VL to detect objects via natural language whose coordinates are then fed to the SAM2 model for segmentation. ...

Read MoreRead More

Fine-Tuning Qwen3-VL

Sovit Ranjan RathSovit Ranjan Rath December 29, 2025December 29, 2025 0 Comment
Fine-Tuning Qwen3-VL

In this article, we are fine-tuning the Qwen3-VL 2B model for sketch and image to HTML. After fine-tuning, we will be able to feed an image of a website to the model and get the HTML code for it. ...

Read MoreRead More

Creating a Sketch to HTML Application with Qwen3-VL

Sovit Ranjan RathSovit Ranjan Rath December 22, 2025December 22, 2025 0 Comment
Creating a Sketch to Image Application with Qwen3-VL

In this article, we explore creating a simple sketch to HTML application using Qwen3-VL where users can upload an image or screenshot for a potential website and the Qwen3-VL model will give back the HTML. ...

Read MoreRead More

Introduction to Qwen3-VL

Sovit Ranjan RathSovit Ranjan Rath December 15, 2025December 15, 2025 2 Comments
Introduction to Qwen3-VL

In this article, we explore the Qwen3-VL model, the latest iteration of the Qwen-VL series. We start with model architecture and benchmarks, and then move to hands-on inference for object detection, OCR, video understanding, and sketch-to-HTML using Qwen3-VL. ...

Read MoreRead More

Fine-Tuning Phi-3.5 Vision Instruct

Sovit Ranjan RathSovit Ranjan Rath December 8, 2025December 8, 2025 0 Comment
Fine-Tuning Phi-3.5 Vision Instruct

In this article we are fine-tuning the Phi-3.5 Vision Instruct model on a receipt OCR dataset. We are using Hugging Face libraries and training a LoRA. ...

Read MoreRead More

Posts pagination

Previous page Page 1 … Page 3 Page 4 Page 5 … Page 78 Next page

Subscribe

* indicates required

Categories

Recent Posts

  • Getting Started with GLM-4.6V
  • Fine-Tuning DeepSeek-OCR 2
  • Understanding DeepSeek-OCR 2
  • DeepSeek-OCR 2 Inference and Gradio Application
  • Multi-Turn Tool Call with gpt-oss-chat

Pages

  • ABOUT
  • CONTACT
  • DCHUB
  • DebuggerCafe
  • Privacy Policy
  • Projects
  • Topics

Reach out

  • Facebook
  • LinkedIn
  • Twitter

Business WordPress Theme copyright 2025

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

DebuggerCafe
Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.

If you disable this cookie, we will not be able to save your preferences. This means that every time you visit this website you will need to enable or disable cookies again.