Data Science Jupyter Notebooks
11.1K subscribers
269 photos
31 videos
9 files
727 links
Explore the world of Data Science through Jupyter Notebooksโ€”insights, tutorials, and tools to boost your data journey. Code, analyze, and visualize smarter with every post.
Download Telegram
๐Ÿš€ New Tutorial: Automatic Number Plate Recognition (ANPR) with YOLOv11 + GPT-4o-mini!


This hands-on tutorial shows you how to combine the real-time detection power of YOLOv11 with the language understanding of GPT-4o-mini to build a smart, high-accuracy ANPR system! From setup to smart prompt engineering, everything is covered step-by-step. ๐Ÿš—๐Ÿ’ก

๐ŸŽฏ Key Highlights:
โœ… YOLOv11 + GPT-4o-mini = High-precision number plate recognition
โœ… Real-time video processing in Google Colab
โœ… Smart prompt engineering for enhanced OCR performance

๐Ÿ“ข A must-watch if you're into computer vision, deep learning, or OpenAI integrations!


๐Ÿ”— Colab Notebook
โ–ถ๏ธ Watch on YouTube


#YOLOv11 #GPT4o #OpenAI #ANPR #OCR #ComputerVision #DeepLearning #AI #DataScience #Python #Ultralytics #MachineLearning #Colab #NumberPlateRecognition

๐Ÿ” By : https://t.iss.one/DataScienceN
๐Ÿ‘2โค1๐Ÿ”ฅ1
๐Ÿ“š JaidedAI/EasyOCR โ€” an open-source Python library for Optical Character Recognition (OCR) that's easy to use and supports over 80 languages out of the box.

### ๐Ÿ” Key Features:

๐Ÿ”ธ Extracts text from images and scanned documents โ€” including handwritten notes and unusual fonts
๐Ÿ”ธ Supports a wide range of languages like English, Russian, Chinese, Arabic, and more
๐Ÿ”ธ Built on PyTorch โ€” uses modern deep learning models (not the old-school Tesseract)
๐Ÿ”ธ Simple to integrate into your Python projects

### โœ… Example Usage:

import easyocr

reader = easyocr.Reader(['en', 'ru']) # Choose supported languages
result = reader.readtext('image.png')


### ๐Ÿ“Œ Ideal For:

โœ… Text extraction from photos, scans, and documents
โœ… Embedding OCR capabilities in apps (e.g. automated data entry)

๐Ÿ”— GitHub: https://github.com/JaidedAI/EasyOCR

๐Ÿ‘‰ Follow us for more: @DataScienceN

#Python #OCR #MachineLearning #ComputerVision #EasyOCR
โค2๐Ÿ”ฅ1
๐Ÿ”ฅ Trending Repository: awesome-deep-text-detection-recognition

๐Ÿ“ Description: A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

๐Ÿ”— Repository URL: https://github.com/hwalsuklee/awesome-deep-text-detection-recognition

๐Ÿ“– Readme: https://github.com/hwalsuklee/awesome-deep-text-detection-recognition#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 2.5K stars
๐Ÿ‘€ Watchers: 148
๐Ÿด Forks: 508 forks

๐Ÿ’ป Programming Languages: Not available

๐Ÿท๏ธ Related Topics:
#ocr #deep_learning #text_recognition #awesome_list #text_detection #ocr_recognition #awesome_lists #text_detection_recognition #ocr_detection #ocr_papers #ocr_paper #ocr_paper_list


==================================
๐Ÿง  By: https://t.iss.one/DataScienceN
โค1
๐Ÿ”ฅ Trending Repository: Umi-OCR

๐Ÿ“ Description: OCR software, free and offline. ๅผ€ๆบใ€ๅ…่ดน็š„็ฆป็บฟOCR่ฝฏไปถใ€‚ๆ”ฏๆŒๆˆชๅฑ/ๆ‰น้‡ๅฏผๅ…ฅๅ›พ็‰‡๏ผŒPDFๆ–‡ๆกฃ่ฏ†ๅˆซ๏ผŒๆŽ’้™คๆฐดๅฐ/้กต็œ‰้กต่„š๏ผŒๆ‰ซๆ/็”ŸๆˆไบŒ็ปด็ ใ€‚ๅ†…็ฝฎๅคšๅ›ฝ่ฏญ่จ€ๅบ“ใ€‚

๐Ÿ”— Repository URL: https://github.com/hiroi-sora/Umi-OCR

๐Ÿ“– Readme: https://github.com/hiroi-sora/Umi-OCR#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 36.7K stars
๐Ÿ‘€ Watchers: 186
๐Ÿด Forks: 3.6K forks

๐Ÿ’ป Programming Languages: Python - QML

๐Ÿท๏ธ Related Topics:
#screenshot #qt #ocr #qml #ocr_python #paddleocr #umi_ocr


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: tesseract

๐Ÿ“ Description: Tesseract Open Source OCR Engine (main repository)

๐Ÿ”— Repository URL: https://github.com/tesseract-ocr/tesseract

๐ŸŒ Website: https://tesseract-ocr.github.io/

๐Ÿ“– Readme: https://github.com/tesseract-ocr/tesseract#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 69.4K stars
๐Ÿ‘€ Watchers: 1.7k
๐Ÿด Forks: 10.2K forks

๐Ÿ’ป Programming Languages: C++ - CMake - Java - Makefile - NSIS - C

๐Ÿท๏ธ Related Topics:
#machine_learning #ocr #tesseract #lstm #tesseract_ocr #hacktoberfest #ocr_engine


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: PaddleOCR

๐Ÿ“ Description: Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

๐Ÿ”— Repository URL: https://github.com/PaddlePaddle/PaddleOCR

๐ŸŒ Website: https://www.paddleocr.ai

๐Ÿ“– Readme: https://github.com/PaddlePaddle/PaddleOCR#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 53.9K stars
๐Ÿ‘€ Watchers: 470
๐Ÿด Forks: 8.6K forks

๐Ÿ’ป Programming Languages: Python - C++ - Shell - Java - CMake - Cuda

๐Ÿท๏ธ Related Topics:
#ocr #db #kie #crnn #document_translation #ocrlite #chineseocr #pp_ocr #document_parsing #pp_structure #pdf2markdown #chatocr


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: Dolphin

๐Ÿ“ Description: The official repo for โ€œDolphin: Document Image Parsing via Heterogeneous Anchor Promptingโ€, ACL, 2025.

๐Ÿ”— Repository URL: https://github.com/bytedance/Dolphin

๐Ÿ“– Readme: https://github.com/bytedance/Dolphin#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 6.3K stars
๐Ÿ‘€ Watchers: 53
๐Ÿด Forks: 516 forks

๐Ÿ’ป Programming Languages: Python - Shell

๐Ÿท๏ธ Related Topics:
#python #pdf #parser #ocr #pdf_converter #document_analysis #pdf_parser #layout_analysis #vlm_ocr


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: siyuan

๐Ÿ“ Description: A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

๐Ÿ”— Repository URL: https://github.com/siyuan-note/siyuan

๐ŸŒ Website: https://b3log.org/siyuan

๐Ÿ“– Readme: https://github.com/siyuan-note/siyuan#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 37.6K stars
๐Ÿ‘€ Watchers: 159
๐Ÿด Forks: 2.3K forks

๐Ÿ’ป Programming Languages: TypeScript - Go - JavaScript - SCSS - HTML - CSS

๐Ÿท๏ธ Related Topics:
#electron #markdown #pdf #ocr #s3 #webdav #self_hosted #openai #note_taking #evernote #anki #knowledge_base #obsidian #notion #notes_app #local_first #chatgpt #ollama #deepseek


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: LaTeX-OCR

๐Ÿ“ Description: pix2tex: Using a ViT to convert images of equations into LaTeX code.

๐Ÿ”— Repository URL: https://github.com/lukas-blecher/LaTeX-OCR

๐ŸŒ Website: https://lukas-blecher.github.io/LaTeX-OCR/

๐Ÿ“– Readme: https://github.com/lukas-blecher/LaTeX-OCR#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 15.4K stars
๐Ÿ‘€ Watchers: 85
๐Ÿด Forks: 1.2K forks

๐Ÿ’ป Programming Languages: Python - JavaScript - Jupyter Notebook

๐Ÿท๏ธ Related Topics:
#python #machine_learning #ocr #latex #deep_learning #image_processing #pytorch #dataset #transformer #vit #image2text #im2text #im2latex #im2markup #math_ocr #vision_transformer #latex_ocr


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: MinerU

๐Ÿ“ Description: Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

๐Ÿ”— Repository URL: https://github.com/opendatalab/MinerU

๐ŸŒ Website: https://opendatalab.github.io/MinerU/

๐Ÿ“– Readme: https://github.com/opendatalab/MinerU#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 45.7K stars
๐Ÿ‘€ Watchers: 183
๐Ÿด Forks: 3.8K forks

๐Ÿ’ป Programming Languages: Python - Dockerfile

๐Ÿท๏ธ Related Topics:
#python #pdf #parser #ocr #pdf_converter #extract_data #document_analysis #pdf_parser #layout_analysis #ai4science #pdf_extractor_rag #pdf_extractor_llm #pdf_extractor_pretrain


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
โค1