Skip to content
DebuggerCafe

Machine Learning and Deep Learning

  • ABOUT
  • CONTACT
  • DCHUB
  • DebuggerCafe
  • Privacy Policy
  • Projects
  • Topics
Close Menu

Multimodal RAG with Phi 3.5

Sovit Ranjan RathSovit Ranjan Rath November 4, 2024November 4, 2024 0 Comment
Multimodal RAG with Phi 3.5

In this article, we create a multimodal RAG application from scratch to chat with PDFs, text files, images, and videos using the Phi-3.5 family of language models. ...

Read MoreRead More

Phi-3.5 Vision: Multi-Turn Multimodal Chat with Images and Videos

Sovit Ranjan RathSovit Ranjan Rath October 28, 2024October 28, 2024 0 Comments
Phi-3-5 Vision Multi-Turn Multimodal Chat with Images and Videos

In this article, we create a multimodal chat interface with Gradio to chat with images and videos using Phi-3.5 Vision Instruct model. ...

Read MoreRead More

Serving LLMs using LitServe

Sovit Ranjan RathSovit Ranjan Rath October 21, 2024October 21, 2024 0 Comment
Serving LLMs using LitServe

In this article, we use LitGPT, LitAPI, and LitServe for serving LLMs using Lightning Studio and also on the local system. ...

Read MoreRead More

Torchvision Backbones for DeepLab Segmentation

Sovit Ranjan RathSovit Ranjan Rath October 14, 2024October 14, 2024 2 Comments
Torchvision Backbones for DeepLab Segmentation

In this article, we use different Torchvision backbones for creating DeepLab segmentation models and train it on the Pascal VOC semantic segmentation dataset. ...

Read MoreRead More

Adding Models to Ollama

Sovit Ranjan RathSovit Ranjan Rath October 7, 2024October 7, 2024 0 Comment
Adding Models to Ollama

In this article, we are adding a custom model to Ollama. We take a fine-tuned Hugging Face model, StarCoder2-3B, convert it to GGUF format, add it to local Ollama, and push the model to Ollama hub. ...

Read MoreRead More

Posts pagination

Previous page Page 1 … Page 6 Page 7 Page 8 … Page 69 Next page

Subscribe

* indicates required

Categories

Recent Posts

  • Qwen2.5-Omni: An Introduction
  • Fine-Tuning SmolVLM for Receipt OCR
  • Gemma 3 – Advancing Open, Lightweight, Multimodal AI
  • SmolVLM: Accessible Image Captioning with Small Vision Language Model
  • Gradio Application using Qwen2.5-VL

Pages

  • ABOUT
  • CONTACT
  • DCHUB
  • DebuggerCafe
  • Privacy Policy
  • Projects
  • Topics

Reach out

  • Facebook
  • LinkedIn
  • Twitter

Business WordPress Theme copyright 2025

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

DebuggerCafe
Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.

If you disable this cookie, we will not be able to save your preferences. This means that every time you visit this website you will need to enable or disable cookies again.