Machine learning books and papers
22.1K subscribers
958 photos
54 videos
928 files
1.3K links
Admin: @Raminmousa
Watsapp: +989333900804
ID: @Machine_learn
link: https://t.iss.one/Machine_learn
Download Telegram
DeepSeek-Coder

DeepSeek Coder is composed of a series of code language models, each trained from scratch on 2T tokens, with a composition of 87% code and 13% natural language in both English and Chinese. We provide various sizes of the code model, ranging from 1B to 33B versions. Each model is pre-trained on project-level code corpus by employing a window size of 16K and an extra fill-in-the-blank task, to support project-level code completion and infilling. For coding capabilities, DeepSeek Coder achieves state-of-the-art performance among open-source code models on multiple programming languages and various benchmarks.

Creator: Deepseek-AI
Stars ⭐️: 15.6k
Forked by: 1.5k

Github Repo:
https://github.com/deepseek-ai/DeepSeek-Coder

@Machine_learn
❀7πŸ‘1
Full PyTorch Implementation of
Compressive Transformer


πŸ“š Link


@Machine_learn
πŸ‘2
probability_cheatsheet.pdf
789.3 KB
Probability Cheatsheet
@Machine_learn
❀4πŸ‘1
Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge Graphs

πŸ–₯ Github: https://github.com/reml-group/deliberation-on-priors

πŸ“• Paper: https://arxiv.org/abs/2505.15210v1

@Machine_learn
❀1
Reinforcement Learning: An Overview

πŸ“š Book


@Machine_learn
❀4
The 2025 AI Index Report

πŸ“š Read

@Machine_learn
πŸ‘3
πŸŽ“Advanced Applications of Machine Learning in Bioinformatics



πŸ—“Publish year: 2025

πŸ“Ž Study thesis


@Machine_learn
❀3
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

24 Apr 2025 Β· Minju Seo, Jinheon Baek, Seongyun Lee, Sung Ju Hwang Β·



Paper: https://arxiv.org/pdf/2504.17192v2.pdf

Code: https://github.com/going-doer/paper2code

@Machine_learn
πŸ”₯4❀2πŸ‘1
THE WAY OF CODE The Timeless Art of Vibe Coding

πŸ“š link


@Machine_learn
πŸ‘2❀1
EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video

πŸ“š Paper

@Machine_learn
❀1
TabSTAR: A Foundation Tabular Model With
Semantically Target-Aware Representations


πŸ“š Paper

@Machine_learn
❀1
System Card: Claude Opus 4 & Claude Sonnet 4

πŸ“š Book


@Machine_learn
❀1
MuLoCo: Muon is a practical inner optimizer for DiLoCo

πŸ“š Read

@Machine_learn
❀2
A Tutorial on Meta-Reinforcement
Learning


πŸ“š Read

@Machine_learn
❀1
COUNTING THE NUMBER OF Zp-AND Fp[t]-FIXED POINTS OF A DISCRETE DYNAMICAL
SYSTEM WITH APPLICATIONS FROM ARITHMETIC STATISTICS


πŸ“š Read


@Machine_learn
❀1
Forwarded from Github LLMs
Owen 3 release

πŸ“– Blog


@LLM_learning
❀1πŸ”₯1
Article Title:
Uncertainty Quantification for Language Models: A Suite of Black-Box, White-Box, LLM Judge, and Ensemble Scorers


PDF Download Link:
https://arxiv.org/pdf/2504.19254v2.pdf

GitHub:
β€’ https://github.com/cvs-health/uqlm

Datasets:
β€’ GSM8K
β€’ SVAMP
β€’ PopQA
@Machine_learn
❀‍πŸ”₯1❀1
Forecasting: Principles and Practice

πŸ“š Book

@Machine_learn
❀5
Article Title:
s3: You Don't Need That Much Data to Train a Search Agent via RL





PDF Download Link:
https://arxiv.org/pdf/2505.14146v1.pdf

GitHub:
β€’ https://github.com/pat-jj/s3

Datasets:
β€’ Natural Questions
β€’ TriviaQA
β€’ HotpotQA
β€’ MedQA
β€’ PubMedQA
==================================
@Machine_learn
❀3