Machine Learning
40.2K subscribers
3.61K photos
29 videos
47 files
636 links
Real Machine Learning — simple, practical, and built on experience.
Learn step by step with clear explanations and working code.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
Training YOLOv12 for Detecting Pothole Severity Using a Custom Dataset

📖 Table of Contents Training YOLOv12 for Detecting Pothole Severity Using a Custom Dataset Introduction Dataset and Task Overview About the Dataset What Are We Detecting? Defining Pothole Severity Can the Pothole Severity Logic Be Improved? Configuring Your Development Environment Training…...

🏷️ #ComputerVision #DeepLearning #ObjectDetection #Tutorial #YOLO
👍1
Training YOLOv12 for Detecting Pothole Severity Using a Custom Dataset

📖 Table of Contents Training YOLOv12 for Detecting Pothole Severity Using a Custom Dataset Introduction Dataset and Task Overview About the Dataset What Are We Detecting? Defining Pothole Severity Can the Pothole Severity Logic Be Improved? Configuring Your Development Environment Training…...

🏷️ #ComputerVision #DeepLearning #ObjectDetection #Tutorial #YOLO
Sharpen Your Vision: Super-Resolution of CCTV Images Using Hugging Face Diffusers

📖 Table of Contents Sharpen Your Vision: Super-Resolution of CCTV Images Using Hugging Face Diffusers Configuring Your Development Environment Problem Statement How Does Super-Resolution Solve This? State-of-the-Art Approaches Generative Adversarial Networks (GANs) Diffusion Models Implementing Diffus...

🏷️ #ArtificialIntelligence #ComputerVision #DeepLearning #ImageProcessing #MachineLearning #Tutorial
Unlocking Image Clarity: A Comprehensive Guide to Super-Resolution Techniques

📖 Table of Contents Unlocking Image Clarity: A Comprehensive Guide to Super-Resolution Techniques Introduction Configuring Your Development Environment Need Help Configuring Your Development Environment? What Is Super-Resolution? Usual Problems with Low-Resolution Imagery Traditional Computer Vision A...

🏷️ #ArtificialIntelligence #ComputerVision #DeepLearning #ImageProcessing #MachineLearning #TechnologyApplications #Tutorial
CycleGAN: Unpaired Image-to-Image Translation (Part 1)

📖 Table of Contents CycleGAN: Unpaired Image-to-Image Translation (Part 1) Introduction Unpaired Image Translation CycleGAN Pipeline and Training Loss Formulation Adversarial Loss Cycle Consistency Summary Citation Information CycleGAN: Unpaired Image-to-Image Translation (Part 1) In this tutorial, yo...

🏷️ #ComputerVision #CycleGAN #DeepLearning #Keras #KerasandTensorFlow #TensorFlow #UnpairedImageTranslation
Training YOLOv12 for Detecting Pothole Severity Using a Custom Dataset

📖 Table of Contents Training YOLOv12 for Detecting Pothole Severity Using a Custom Dataset Introduction Dataset and Task Overview About the Dataset What Are We Detecting? Defining Pothole Severity Can the Pothole Severity Logic Be Improved? Configuring Your Development Environment Training…...

🏷️ #ComputerVision #DeepLearning #ObjectDetection #Tutorial #YOLO
People Tracker with YOLOv12 and Centroid Tracker

📖 Table of Contents People Tracker with YOLOv12 and Centroid Tracker Introduction Why People Tracker Monitoring Matters How YOLOv12 Enables Real-Time Applications Configuring Your Development Environment Downloading the Input Video Install gdown Download the Video Visualizing the Inference and Trackin...

🏷️ #ComputerVision #ObjectDetection #PeopleTracker #Tutorial #YOLOv12
Meet BLIP: The Vision-Language Model Powering Image Captioning

📖 Table of Contents Meet BLIP: The Vision-Language Model Powering Image Captioning What Is Image Captioning and Why Is It Challenging? Why It’s Challenging Why Traditional Vision Tasks Aren’t Enough Configuring Your Development Environment A Brief History of Image Captioning Models…...

🏷️ #ComputerVision #DeepLearning #ImageCaptioning #MultimodalAI #Tutorial
1
🤖🧠 Thinking with Camera 2.0: A Powerful Multimodal Model for Camera-Centric Understanding and Generation

🗓️ 14 Oct 2025
📚 AI News & Trends

In the rapidly evolving field of multimodal AI, bridging gaps between vision, language and geometry is one of the frontier challenges. Traditional vision-language models excel at describing what is in an image “a cat on a sofa” “a red car on the road” but struggle to reason about how the image was captured: the camera’s ...

#MultimodalAI #CameraCentricUnderstanding #VisionLanguageModels #AIResearch #ComputerVision #GenerativeModels