Skip to content
DebuggerCafe

Machine Learning and Deep Learning

  • ABOUT
  • CONTACT
  • DCHUB
  • DebuggerCafe
  • Privacy Policy
  • Projects
  • Topics
Close Menu

Human Action Recognition using 2D CNN with PyTorch

Sovit Ranjan RathSovit Ranjan Rath June 12, 2023June 12, 2023 0 Comment
Human Action Recognition using 2D CNN with PyTorch

Human Action Recognition using a 2D CNN, specifically, fine tuning a pretrained ResNet50 model on a dataset with 15 action classes. ...

Read MoreRead More

Using Custom Backbone for PyTorch SSD for Object Detection

Sovit Ranjan RathSovit Ranjan Rath June 5, 2023June 5, 2023 52 Comments
Using Custom Backbone for PyTorch SSD for Object Detection

In this article, we use a PyTorch SSD model with custom ResNet34 backbone and train it on a person detection dataset. ...

Read MoreRead More

Train DETR on Custom Dataset

Sovit Ranjan RathSovit Ranjan Rath May 29, 2023May 29, 2023 2 Comments
Train DETR on Custom Dataset

In this article, we fine fine DETR (Detection Transformer) models on a custom aquarium dataset. After training, we also run inference on the test dataset and unseen videos. ...

Read MoreRead More

DETR for Object Detection

Sovit Ranjan RathSovit Ranjan Rath May 22, 2023May 22, 2023 9 Comments
DETR for Object Detection

In this article, we explore the DETR model for object detection. Along with a short discussion of the model architecture, we also carry out inference on videos. ...

Read MoreRead More

Train PyTorch RetinaNet on Custom Dataset

Sovit Ranjan RathSovit Ranjan Rath May 15, 2023May 15, 2023 29 Comments
Train PyTorch RetinaNet on Custom Dataset

In this article, we train the PyTorch RetinaNet model on a custom BCCD dataset. We go through the configuration of the RetinaNet model for custom training, train the model, and also carry out inference. ...

Read MoreRead More

Posts pagination

Previous page Page 1 … Page 20 Page 21 Page 22 … Page 68 Next page

Subscribe

* indicates required

Categories

Recent Posts

  • SmolVLM: Accessible Image Captioning with Small Vision Language Model
  • Gradio Application using Qwen2.5-VL
  • Qwen2.5-VL: Architecture, Benchmarks and Inference
  • Phi-4 Mini and Phi-4 Multimodal
  • ViTPose – Human Pose Estimation with Vision Transformer

Pages

  • ABOUT
  • CONTACT
  • DCHUB
  • DebuggerCafe
  • Privacy Policy
  • Projects
  • Topics

Reach out

  • Facebook
  • LinkedIn
  • Twitter

Business WordPress Theme copyright 2025

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

DebuggerCafe
Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.

If you disable this cookie, we will not be able to save your preferences. This means that every time you visit this website you will need to enable or disable cookies again.