DebuggerCafe - Deep Learning, Machine Learning, Artificial Intelligence

Training Gemma 3n for Transcription and Translation

In this article, we are training the Gemma 3n model for transcription and translation of German audio files to English using the Unsloth library and creating a Gradio application also. ...

Fine-Tuning Gemma 3n for Speech Transcription

Sovit Ranjan Rath October 13, 2025 0 Comments

In this article, we are fine-tuning Gemma 3n for German speech transcription using the Unsloth library and running evaluations before and after training. ...

Multimodal Gradio App with Together AI

Sovit Ranjan Rath October 6, 2025 0 Comment

In this article, we create a multimodal Gradio application with Together AI models for chatting LLMs & VLMs, generating images, and automatic speech transcription using OpenAI Whisper models. ...

Serverless Inference with Together AI

Sovit Ranjan Rath September 29, 2025 0 Comments

In this article, we explore Together AI, a serverless generative AI platform for text generation, vision language models, image generation, and more. ...

Background Replacement Using BiRefNet

Sovit Ranjan Rath September 22, 2025 0 Comment

In this article, we create a background replacement application using BiRefNet. We cover the code using Jupyter Notebook and create a Gradio application as well. ...