DL in NLP links – Telegram

DL in NLP links

1.06K subscribers

5 photos

1 file

653 links

AI and DeepLearning news/articles links I use for @dlinnlp posts

Download Telegram

About

Blog

Apps

Platform

DL in NLP links

1.06K subscribers

DL in NLP links

https://x.com/spikedoanz/status/1831127711856935273?s=12&t=757tdnLa___vKX7ZeJax5A

4.07K views05:01

DL in NLP links

https://discuss.pytorch.org/t/distributed-w-torchtitan-introducing-async-tensor-parallelism-in-pytorch/209487

[Distributed w/ TorchTitan] Introducing Async Tensor Parallelism in PyTorch

with Horace He, Less Wright, Luca Wehrstedt, Tianyu Liu, Wanchao Liang TL;DR We implemented experimental async tensor parallelism support in PyTorch. We integrated it in TorchTitan and observed: Up to ~29% forward pass speedup and ~8% E2E speedup in Llama3…

🔥2

4.21K views03:55

DL in NLP links

https://arxiv.org/abs/2409.12917

Training Language Models to Self-Correct via Reinforcement Learning

Self-correction is a highly desirable capability of large language models (LLMs), yet it has consistently been found to be largely ineffective in modern LLMs. Current methods for training...

4.4K views04:29

DL in NLP links

https://arxiv.org/pdf/2405.08007

4.72K views17:30

DL in NLP links

Tldr: act dumb

4.53K views17:30

DL in NLP links

https://x.com/kellerjordan0/status/1842300916864844014?s=12&t=QgBLS4SmhE8cqdYBmhrqJA

3.97K views15:38

DL in NLP links

https://x.com/stasbekman/status/1843483262129492200?s=12&t=QgBLS4SmhE8cqdYBmhrqJA

4.12K views15:52

DL in NLP links

https://archive.is/2024.10.07-184310/https://www.theatlantic.com/technology/archive/2024/10/terence-tao-ai-interview/680153/

We’re Entering Uncharted Territory for Math - The Atlantic

archived 7 Oct 2024 18:43:10 UTC

❤1

4.89K views02:31

DL in NLP links

https://x.com/arankomatsuzaki/status/1844567821184872544?s=12&t=QgBLS4SmhE8cqdYBmhrqJA

5.3K views04:21

DL in NLP links

https://x.com/pronounced_kyle/status/1845451573608186103

6.31K views18:36

DL in NLP links

https://arxiv.org/pdf/2406.14517

6.61K views15:45

DL in NLP links

https://x.com/yoavgo/status/1845835419264442772?s=12&t=QgBLS4SmhE8cqdYBmhrqJA

👍1

7.29K views15:47

DL in NLP links

https://arxiv.org/abs/2410.05258

Differential Transformer

Transformer tends to overallocate attention to irrelevant context. In this work, we introduce Diff Transformer, which amplifies attention to the relevant context while canceling noise....

7.49K views03:24

DL in NLP links

https://x.com/lchoshen/status/1849060908242231329?s=12&t=QgBLS4SmhE8cqdYBmhrqJA

7.23K views02:37

DL in NLP links

https://arxiv.org/abs/2410.01104

Softmax is not Enough (for Sharp Size Generalisation)

A key property of reasoning systems is the ability to make sharp decisions on their input data. For contemporary AI systems, a key carrier of sharp behaviour is the softmax function, with its...

👍1

8.06K views09:00

DL in NLP links

https://x.com/svlevine/status/1856924796996784244?s=12&t=QgBLS4SmhE8cqdYBmhrqJA

7.88K views17:22

DL in NLP links

https://arxiv.org/abs/2412.01799

HPRM: High-Performance Robotic Middleware for Intelligent...

The rise of intelligent autonomous systems, especially in robotics and autonomous agents, has created a critical need for robust communication middleware that can ensure real-time processing of...

7.89K views17:41

DL in NLP links

https://x.com/thehumanoidhub/status/1868219800532771248?s=12&t=QgBLS4SmhE8cqdYBmhrqJA

X (formerly Twitter)

The Humanoid Hub (@TheHumanoidHub) on X

Meta Motivo, an open-source behavioral foundation model designed to control virtual, physics-based humanoid agents.

It aims to significantly simplify the creation of general-purpose humanoid agents for robotics and virtual avatars.

Try the demo: https:…

❤1👍1

8.41K views15:59

DL in NLP links

https://x.com/gargighosh/status/1873522368301408749?s=12&t=QgBLS4SmhE8cqdYBmhrqJA

7.72K views15:11

DL in NLP links

https://r0bk.github.io/killedbyllm/

❤2

7.58K views20:42

DL in NLP links

https://www.kscale.dev/zbot

6.81K views18:16