Skip to content
DebuggerCafe

Machine Learning and Deep Learning

  • ABOUT
  • CONTACT
  • DCHUB
  • DebuggerCafe
  • Privacy Policy
  • Projects
  • Topics
Close Menu

Fine Tuning Phi 1.5 using QLoRA

Sovit Ranjan RathSovit Ranjan Rath May 20, 2024May 20, 2024 15 Comments
Fine Tuning Phi 1.5 using QLoRA

In this article, we are fine tuning the Phi 1.5 model using QLoRA on the Stanford Alpaca dataset with Hugging Face Transformers. ...

Read MoreRead More

Hugging Face Autotrain – Getting Started

Sovit Ranjan RathSovit Ranjan Rath May 13, 2024May 13, 2024 3 Comments
Hugging Face Autotrain Getting Started

In this article, we use the Hugging Face Autotrain no code platform to train the GPT2 Large model for following instructions. ...

Read MoreRead More

Instruction Tuning GPT2 on Alpaca Dataset

Sovit Ranjan RathSovit Ranjan Rath May 6, 2024May 6, 2024 6 Comments
Instruction Tuning GPT2 on Alpaca Dataset

In this article, we are instruction tuning the GPT2 Base model on the Alpaca dataset. We use the Hugging Face Transformers library along with the SFT Trainer Pipeline for this. ...

Read MoreRead More

Instruction Tuning OPT-125M

Sovit Ranjan RathSovit Ranjan Rath April 29, 2024April 29, 2024 7 Comments
Instruction Tuning OPT-125M

In this article, we carry out instruction tuning of the OPT-125M model by training it on the Open Assistant Guanaco dataset using the Hugging Face Transformers library. ...

Read MoreRead More

Fine-Tuning GPT2 for Text Generation

Sovit Ranjan RathSovit Ranjan Rath April 22, 2024April 22, 2024 3 Comments
Fine-Tuning GPT2 for Text Generation

In this article, we train the DistilGPT2 model for detective story generation. We use the Hugging Face Transformers library to fine-tune the model on Arthur Conan Doyle's collection of Sherlock Holmes stories. ...

Read MoreRead More

Posts pagination

Previous page Page 1 … Page 11 Page 12 Page 13 … Page 69 Next page

Subscribe

* indicates required

Categories

Recent Posts

  • Getting Started with SmolVLM2 – Code Inference
  • Qwen2.5-Omni: An Introduction
  • Fine-Tuning SmolVLM for Receipt OCR
  • Gemma 3 – Advancing Open, Lightweight, Multimodal AI
  • SmolVLM: Accessible Image Captioning with Small Vision Language Model

Pages

  • ABOUT
  • CONTACT
  • DCHUB
  • DebuggerCafe
  • Privacy Policy
  • Projects
  • Topics

Reach out

  • Facebook
  • LinkedIn
  • Twitter

Business WordPress Theme copyright 2025

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

DebuggerCafe
Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.

If you disable this cookie, we will not be able to save your preferences. This means that every time you visit this website you will need to enable or disable cookies again.