Skip to content
DebuggerCafe

Machine Learning and Deep Learning

  • ABOUT
  • CONTACT
  • DCHUB
  • DebuggerCafe
  • Privacy Policy
  • Projects
  • Topics
Close Menu

Semantic Segmentation using Web-DINO

Sovit Ranjan RathSovit Ranjan Rath June 30, 2025June 30, 2025 0 Comment
Semantic Segmentation using Web-DINO

In this article, we are modifying the Web-DINO 300M architecture for semantic segmentation. We will add a simple segmentation decoder head and train the model for person segmentation. ...

Read MoreRead More

Image Classification with Web-DINO

Sovit Ranjan RathSovit Ranjan Rath June 23, 2025June 23, 2025 0 Comments
Image Classification with Web-DINO

In this article we use the Web-DINO model for image classification. We modify the Web-DINO 300M model, adding a classification head on top, freezing the backbone, and training on cotton disease classification task. ...

Read MoreRead More

Web-SSL: Scaling Language Free Visual Representation

Sovit Ranjan RathSovit Ranjan Rath June 16, 2025June 16, 2025 0 Comments
Web-SSL: Scaling Language Free Visual Representation

In this article, we explore the Web-DINO models trained via Web-SSL 2.0 methodology on the MC-2B (MetaCLIP-2B) dataset. ...

Read MoreRead More

Getting Started with SmolVLM2 – Code Inference

Sovit Ranjan RathSovit Ranjan Rath June 9, 2025June 9, 2025 0 Comment
Getting Started with SmolVLM2 – Code Inference

In this article, we cover inference code for SmolVLM2. We carry out image and video inference experiments using SmolVLM2-2.2B-Instruct and SmolVLM2-256M-Instruct. ...

Read MoreRead More

Qwen2.5-Omni: An Introduction

Sovit Ranjan RathSovit Ranjan Rath June 2, 2025June 2, 2025 0 Comment
Qwen2.5-Omni: An Introduction

In this article, we explore Qwen2.5-Omni, a multimodal generative AI model that can accept text, image, video, and audio as inputs while outputting both text and audio. ...

Read MoreRead More

Posts pagination

Previous page Page 1 … Page 7 Page 8 Page 9 … Page 77 Next page

Subscribe

* indicates required

Categories

Recent Posts

  • gpt-oss-chat Local RAG and Web Search
  • SAM 3 UI – Image, Video, and Multi-Object Inference
  • gpt-oss Inference with llama.cpp
  • SAM 3 Inference and Paper Explanation
  • Hunyuan3D 2.0 – Explanation and Runpod Docker Image

Pages

  • ABOUT
  • CONTACT
  • DCHUB
  • DebuggerCafe
  • Privacy Policy
  • Projects
  • Topics

Reach out

  • Facebook
  • LinkedIn
  • Twitter

Business WordPress Theme copyright 2025

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

DebuggerCafe
Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.

If you disable this cookie, we will not be able to save your preferences. This means that every time you visit this website you will need to enable or disable cookies again.