DeepSeek-Coder
DeepSeek Coder is composed of a series of code language models, each trained from scratch on 2T tokens, with a composition of 87% code and 13% natural language in both English and Chinese. We provide various sizes of the code model, ranging from 1B to 33B versions. Each model is pre-trained on project-level code corpus by employing a window size of 16K and an extra fill-in-the-blank task, to support project-level code completion and infilling. For coding capabilities, DeepSeek Coder achieves state-of-the-art performance among open-source code models on multiple programming languages and various benchmarks.
Creator: Deepseek-AI
Stars βοΈ: 15.6k
Forked by: 1.5k
Github Repo:
https://github.com/deepseek-ai/DeepSeek-Coder
@Machine_learn
DeepSeek Coder is composed of a series of code language models, each trained from scratch on 2T tokens, with a composition of 87% code and 13% natural language in both English and Chinese. We provide various sizes of the code model, ranging from 1B to 33B versions. Each model is pre-trained on project-level code corpus by employing a window size of 16K and an extra fill-in-the-blank task, to support project-level code completion and infilling. For coding capabilities, DeepSeek Coder achieves state-of-the-art performance among open-source code models on multiple programming languages and various benchmarks.
Creator: Deepseek-AI
Stars βοΈ: 15.6k
Forked by: 1.5k
Github Repo:
https://github.com/deepseek-ai/DeepSeek-Coder
@Machine_learn
GitHub
GitHub - deepseek-ai/DeepSeek-Coder: DeepSeek Coder: Let the Code Write Itself
DeepSeek Coder: Let the Code Write Itself. Contribute to deepseek-ai/DeepSeek-Coder development by creating an account on GitHub.
β€7π1
Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge Graphs
π₯ Github: https://github.com/reml-group/deliberation-on-priors
π Paper: https://arxiv.org/abs/2505.15210v1
@Machine_learn
π₯ Github: https://github.com/reml-group/deliberation-on-priors
π Paper: https://arxiv.org/abs/2505.15210v1
@Machine_learn
β€1
πAdvanced Applications of Machine Learning in Bioinformatics
πPublish year: 2025
π Study thesis
@Machine_learn
πPublish year: 2025
π Study thesis
@Machine_learn
β€3
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning
24 Apr 2025 Β· Minju Seo, Jinheon Baek, Seongyun Lee, Sung Ju Hwang Β·
Paper: https://arxiv.org/pdf/2504.17192v2.pdf
Code: https://github.com/going-doer/paper2code
@Machine_learn
24 Apr 2025 Β· Minju Seo, Jinheon Baek, Seongyun Lee, Sung Ju Hwang Β·
Paper: https://arxiv.org/pdf/2504.17192v2.pdf
Code: https://github.com/going-doer/paper2code
@Machine_learn
π₯4β€2π1
TabSTAR: A Foundation Tabular Model With
Semantically Target-Aware Representations
π Paper
@Machine_learn
Semantically Target-Aware Representations
π Paper
@Machine_learn
β€1
COUNTING THE NUMBER OF Zp-AND Fp[t]-FIXED POINTS OF A DISCRETE DYNAMICAL
SYSTEM WITH APPLICATIONS FROM ARITHMETIC STATISTICS
π Read
@Machine_learn
SYSTEM WITH APPLICATIONS FROM ARITHMETIC STATISTICS
π Read
@Machine_learn
β€1
Article Title:
Uncertainty Quantification for Language Models: A Suite of Black-Box, White-Box, LLM Judge, and Ensemble Scorers
PDF Download Link:
https://arxiv.org/pdf/2504.19254v2.pdf
GitHub:
β’ https://github.com/cvs-health/uqlm
Datasets:
β’ GSM8K
β’ SVAMP
β’ PopQA
@Machine_learn
Uncertainty Quantification for Language Models: A Suite of Black-Box, White-Box, LLM Judge, and Ensemble Scorers
PDF Download Link:
https://arxiv.org/pdf/2504.19254v2.pdf
GitHub:
β’ https://github.com/cvs-health/uqlm
Datasets:
β’ GSM8K
β’ SVAMP
β’ PopQA
@Machine_learn
β€βπ₯1β€1
Good papers
Solving Video Inverse Problems Using Image Diffusion Models
Deep Random Features for Scalable Interpolation of Spatiotemporal Data
Century: A Framework and Dataset for Evaluating Ethical Contextualisation of Sensitive Images
No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance
How much is a noisy image worth? Data Scaling Laws for Ambient Diffusion
A Decadeβs Battle on Dataset Bias: Are We There Yet?
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
@Machine_learn
Solving Video Inverse Problems Using Image Diffusion Models
Deep Random Features for Scalable Interpolation of Spatiotemporal Data
Century: A Framework and Dataset for Evaluating Ethical Contextualisation of Sensitive Images
No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance
How much is a noisy image worth? Data Scaling Laws for Ambient Diffusion
A Decadeβs Battle on Dataset Bias: Are We There Yet?
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
@Machine_learn
arXiv.org
Solving Video Inverse Problems Using Image Diffusion Models
Recently, diffusion model-based inverse problem solvers (DIS) have emerged as state-of-the-art approaches for addressing inverse problems, including image super-resolution, deblurring, inpainting,...
β€9
Article Title:
s3: You Don't Need That Much Data to Train a Search Agent via RL
PDF Download Link:
https://arxiv.org/pdf/2505.14146v1.pdf
GitHub:
β’ https://github.com/pat-jj/s3
Datasets:
β’ Natural Questions
β’ TriviaQA
β’ HotpotQA
β’ MedQA
β’ PubMedQA
==================================
@Machine_learn
s3: You Don't Need That Much Data to Train a Search Agent via RL
PDF Download Link:
https://arxiv.org/pdf/2505.14146v1.pdf
GitHub:
β’ https://github.com/pat-jj/s3
Datasets:
β’ Natural Questions
β’ TriviaQA
β’ HotpotQA
β’ MedQA
β’ PubMedQA
==================================
@Machine_learn
β€3