DebuggerCafe - Deep Learning, Machine Learning, Artificial Intelligence

Fine-Tuning Gemma 3n for Speech Transcription

In this article, we are fine-tuning Gemma 3n for German speech transcription using the Unsloth library and running evaluations before and after training. ...

Multimodal Gradio App with Together AI

Sovit Ranjan Rath October 6, 2025 0 Comment

In this article, we create a multimodal Gradio application with Together AI models for chatting LLMs & VLMs, generating images, and automatic speech transcription using OpenAI Whisper models. ...

Serverless Inference with Together AI

Sovit Ranjan Rath September 29, 2025 0 Comments

In this article, we explore Together AI, a serverless generative AI platform for text generation, vision language models, image generation, and more. ...

Background Replacement Using BiRefNet

Sovit Ranjan Rath September 22, 2025 0 Comment

In this article, we create a background replacement application using BiRefNet. We cover the code using Jupyter Notebook and create a Gradio application as well. ...

Introduction to BiRefNet

Sovit Ranjan Rath September 15, 2025 2 Comments

In this article, we explore the BiRefNet model for high-resolution dichotomous segmentation. Along with discussing the key elements of the paper, we also create a small background removal codebase usign the pretrained model. ...