ML & AI resources

یه کتاب به نظر جامع برای یادگیری سریع Diffusion

خودم هنوز فرصت نکردم بخونم ولی به نظر به عنوان یه منبع تقریبا آکادمیک و کتاب‌طور، منبع مناسبیه

https://arxiv.org/pdf/2406.08929

❤3

208 viewsAmir 01, 08:42

ML & AI resources

سایت Scholar inbox به شما این قابلیت رو میده که personal digest داشته باشین؛ یعنی پیپرهای مرتبط به فیلدتون رو روزانه بهتون بده (مثل scholar alert ولی همراه با قابلیت‌های دیگه مثل مپ و .‌..)

https://arxiv.org/pdf/2504.08385v1

🔥2👍1

170 viewsSeyed Matin Tavakoli Afshari, 02:38

ML & AI resources

Flow matching in 4 mins

https://x.com/jbhuang0604/status/1950883022942978254?t=BsQv2hm_9VQGHNF0gQsK7A&s=35

155 viewsAmir 01, edited 14:13

ML & AI resources

From GPT-2 to gpt-oss: Analyzing the Architectural Advances
By: Sebastian Raschka

https://magazine.sebastianraschka.com/p/from-gpt-2-to-gpt-oss-analyzing-the

Sebastianraschka

From GPT-2 to gpt-oss: Analyzing the Architectural Advances

And How They Stack Up Against Qwen3

🔥2

134 viewsAli Asad, 13:06

ML & AI resources

Forwarded from Tensorflow(@CVision)

بالاخره صدای زبان فارسی هم شنیده شد!😳

مدل Whisper رو خیلی‌ها می‌شناسن؛ یکی از قوی‌ترین مدل‌ها برای تبدیل صدا به متنه.
اما یه مشکلی که داشت این بود که وقتی نوبت زبان فارسی می‌شد، دقتش پایین میومد و خیلی از کلمات رو درست نمتونست بنویسه.

اما حالا یه نسخه جدید به اسم Whisper-large-fa-v1 منتشر کرده که میتونه زبان فارسی رو به متن تبدیل کنه.
یه فرقی که این نسخه داره اینکه این نسخه روی یه دیتاست تازه به اسم Persian-Voice-v1 دوباره آموزش داده شده. دیتاستی که لهجه‌های مختلف فارسی و اصطلاحات خاص فارسی رو شامل میشه.

نتیجه چیشده؟

تشخیص و رونویسی گفتار فارسی خیلی دقیق‌تر شده.
این یعنی توی کاربردهایی مثل:

✅زیرنویس‌گذاری خودکار
✅ساخت دستیارهای صوتی
✅ابزارهای NLP فارسی

و مهم از همه اینکه این همه‌چی متن‌باز منتشر شده؛ یعنی هر پژوهشگر یا تیمی می‌تونه راحت استفاده کنه، تغییر بده و پروژه‌های جدید بسازه.

لینک مدل: https://huggingface.co/vhdm/whisper-large-fa-v1

لینک دیتاست: https://huggingface.co/datasets/vhdm/persian-voice-v1

منبع: https://www.linkedin.com/feed/update/urn:li:activity:7364194597717073925/

❤2

112 viewsAmir 01, 10:41

ML & AI resources

Diffusion models demystified, once and for all!

https://www.youtube.com/watch?v=Fk2I6pa6UeA&list=WL&index=19

YouTube

More Than Image Generators: A Science of Problem-Solving using Probability | Diffusion Models

This is my entry to #SoME4, 3Blue1Brown's Summer of Math Exposition Competition!

Diffusion models are typically portrayed as models that learn to denoise a corrupted image. This way, they can generate new images by gradually removing noise from a sample…

🔥4

150 viewsSeyed Matin Tavakoli Afshari, 23:41

ML & AI resources

https://siboehm.com/articles/22/CUDA-MMM

Siboehm

How to Optimize a CUDA Matmul Kernel for cuBLAS-like Performance: a Worklog

In this post, I’ll iteratively optimize an implementation of matrix multiplication written in CUDA.My goal is not to build a cuBLAS replacement, but to deepl...

123 viewsAmir 01, 09:41

ML & AI resources

https://youtu.be/i6l3535vRjA?si=4Ji4yw36d-nLX5YO

YouTube

AI Compression is 300x Better (but we don't use it)

It's crazy AI compression is still not the standard!
To learn for free on Brilliant, go to https://brilliant.org/GalLahat/ . You’ll also get 20% off an annual premium subscription.

Voice type with Peach Beta 🍑:
https://peach-voice.com

This video was sponsored…

🔥2

98 viewsSeyed Matin Tavakoli Afshari, 22:30

ML & AI resources

https://www.youtube.com/watch?v=R0uMcXsfo2o

YouTube

The physics behind diffusion models

Diffusion models build on the same mathematical framework as physical diffusion. In this video, we get to the core of the connection between the physics of motion and generative AI.

Topics covered:
• The intuition of probability landscapes (data as peaks…

🔥3

123 viewsAmirparsa, 23:33

ML & AI resources

https://x.com/keenanisalive/status/1964434335911858552?t=S1GUZLITap6cPKZeqtjDhg&s=35

X (formerly Twitter)

Keenan Crane (@keenanisalive) on X

“Everyone knows” what an autoencoder is… but there's an important complementary picture missing from most introductory material.

In short: we emphasize how autoencoders are implemented—but not always what they represent (and some of the implications of that…

❤2

106 viewsAmir 01, 11:21

ML & AI resources

https://mlhonk.substack.com/p/37-image-editing-with-step1x-edit

Substack

37. Step1X-Edit

How to build a dataset for text-guided image editing

113 viewsAmir 01, 00:04

ML & AI resources

https://www.youtube.com/watch?v=R0uMcXsfo2o

https://youtu.be/iv-5mZ_9CPY?si=8b8Hqrru0H-s2-fR

YouTube

But how do AI images and videos actually work? | Guest video by Welch Labs

Diffusion models, CLIP, and the math of turning text into images
Welch Labs Book: https://www.welchlabs.com/resources/imaginary-numbers-book

Sections
0:00 - Intro
3:37 - CLIP
6:25 - Shared Embedding Space
8:16 - Diffusion Models & DDPM
11:44 - Learning Vector…

🔥1

134 viewsAmir 01, 16:17

ML & AI resources

https://youtube.com/playlist?list=PL05umP7R6ij0hPfU7Yuz8J9WXjlb3MFjm&si=Fdvze07-mSMICJAB

YouTube

Probabilistic Machine Learning 2025 - Philipp Hennig

This is the course on Probabilistic Machine Learning in the Summer Term of 2025 at the University of Tübingen, taught by Professor Philipp Hennig. Probabilis...

82 viewsAmir 01, 06:58

ML & AI resources

Forwarded from DeepMind AI Expert (Farzad 🦅)

اندرو کارپثی گفته بود:
Can you take my 2h13m tokenizer video and translate [into] a book chapter.

We've done it! It includes prose, code & key images. It's a great way to learn this key piece of how LLMs work.
https://www.fast.ai/posts/2025-10-16-karpathy-tokenizers

https://solve.it

fast.ai

Let’s Build the GPT Tokenizer: A Complete Guide to Tokenization in LLMs – fast.ai

A text and code version of Karpathy’s famous tokenizer video.

29 viewsAmir 01, 13:12

About

Blog

Apps

Platform