DL in NLP links
@dlinnlp_links
1.06K
subscribers
5
photos
1
file
653
links
AI and DeepLearning news/articles links I use for
@dlinnlp
posts
Download Telegram
Join
DL in NLP links
1.06K subscribers
DL in NLP links
https://x.com/spikedoanz/status/1831127711856935273?s=12&t=757tdnLa___vKX7ZeJax5A
DL in NLP links
https://discuss.pytorch.org/t/distributed-w-torchtitan-introducing-async-tensor-parallelism-in-pytorch/209487
PyTorch Forums
[Distributed w/ TorchTitan] Introducing Async Tensor Parallelism in PyTorch
with Horace He, Less Wright, Luca Wehrstedt, Tianyu Liu, Wanchao Liang TL;DR We implemented experimental async tensor parallelism support in PyTorch. We integrated it in TorchTitan and observed: Up to ~29% forward pass speedup and ~8% E2E speedup in Llama3β¦
π₯
2
DL in NLP links
https://arxiv.org/abs/2409.12917
arXiv.org
Training Language Models to Self-Correct via Reinforcement Learning
Self-correction is a highly desirable capability of large language models (LLMs), yet it has consistently been found to be largely ineffective in modern LLMs. Current methods for training...
DL in NLP links
https://arxiv.org/pdf/2405.08007
DL in NLP links
Tldr: act dumb
DL in NLP links
https://x.com/kellerjordan0/status/1842300916864844014?s=12&t=QgBLS4SmhE8cqdYBmhrqJA
DL in NLP links
https://x.com/stasbekman/status/1843483262129492200?s=12&t=QgBLS4SmhE8cqdYBmhrqJA
DL in NLP links
https://archive.is/2024.10.07-184310/https://www.theatlantic.com/technology/archive/2024/10/terence-tao-ai-interview/680153/
archive.is
Weβre Entering Uncharted Territory for Math - The Atlantic
archived 7 Oct 2024 18:43:10 UTC
β€
1
DL in NLP links
https://x.com/arankomatsuzaki/status/1844567821184872544?s=12&t=QgBLS4SmhE8cqdYBmhrqJA
DL in NLP links
https://x.com/pronounced_kyle/status/1845451573608186103
DL in NLP links
https://arxiv.org/pdf/2406.14517
DL in NLP links
https://x.com/yoavgo/status/1845835419264442772?s=12&t=QgBLS4SmhE8cqdYBmhrqJA
π
1
DL in NLP links
https://arxiv.org/abs/2410.05258
arXiv.org
Differential Transformer
Transformer tends to overallocate attention to irrelevant context. In this work, we introduce Diff Transformer, which amplifies attention to the relevant context while canceling noise....
DL in NLP links
https://x.com/lchoshen/status/1849060908242231329?s=12&t=QgBLS4SmhE8cqdYBmhrqJA
DL in NLP links
https://arxiv.org/abs/2410.01104
arXiv.org
Softmax is not Enough (for Sharp Size Generalisation)
A key property of reasoning systems is the ability to make sharp decisions on their input data. For contemporary AI systems, a key carrier of sharp behaviour is the softmax function, with its...
π
1
DL in NLP links
https://x.com/svlevine/status/1856924796996784244?s=12&t=QgBLS4SmhE8cqdYBmhrqJA
DL in NLP links
https://arxiv.org/abs/2412.01799
arXiv.org
HPRM: High-Performance Robotic Middleware for Intelligent...
The rise of intelligent autonomous systems, especially in robotics and autonomous agents, has created a critical need for robust communication middleware that can ensure real-time processing of...
DL in NLP links
https://x.com/thehumanoidhub/status/1868219800532771248?s=12&t=QgBLS4SmhE8cqdYBmhrqJA
X (formerly Twitter)
The Humanoid Hub (@TheHumanoidHub) on X
Meta Motivo, an open-source behavioral foundation model designed to control virtual, physics-based humanoid agents.
It aims to significantly simplify the creation of general-purpose humanoid agents for robotics and virtual avatars.
Try the demo: https:β¦
β€
1
π
1
DL in NLP links
https://x.com/gargighosh/status/1873522368301408749?s=12&t=QgBLS4SmhE8cqdYBmhrqJA
DL in NLP links
https://r0bk.github.io/killedbyllm/
β€
2
DL in NLP links
https://www.kscale.dev/zbot