DebuggerCafe - Deep Learning, Machine Learning, Artificial Intelligence

Image-to-3D: Incremental Optimizations for VRAM, Multi-Mesh Output, and UI Improvements

In this article we carry out optimizations for the image-to-3D pipeline in terms of VRAM usage, multi-object generation from prompts, and improved UI. The pipeline uses Qwen3-VL, BiRefNet, and Hunyuan3D models. ...

Image-to-Texture Generation for 3D Meshes

Sovit Ranjan Rath January 19, 2026 2 Comments

In this article, we cover image-to-texture for 3D meshes using Hunyuan3D, Qwen3-VL, and BiRefNet models ...

Image to 3D Mesh Generation with Detection Grounding

Sovit Ranjan Rath January 12, 2026 2 Comments

In this article we create a simple, yet robust pipeline for image to 3D mesh generation with detection grounding using Qwen3-VL, BiRefNet, and Hunyuan3D 2.0 model. ...

Grounding Qwen3-VL Detection with SAM2

Sovit Ranjan Rath January 5, 2026 0 Comment

In this article, are grounding the Qwen3-VL object detection capabilities with SAM2 segmentation. The pipeline uses Qwen3-VL to detect objects via natural language whose coordinates are then fed to the SAM2 model for segmentation. ...

Fine-Tuning Qwen3-VL

Sovit Ranjan Rath December 29, 2025 0 Comment

In this article, we are fine-tuning the Qwen3-VL 2B model for sketch and image to HTML. After fine-tuning, we will be able to feed an image of a website to the model and get the HTML code for it. ...