Chat GPT

State of twitter replies rn

🤣30👍6❤‍🔥2🔥1

10.1K views12:41

Chat GPT

Future is Female (toyota AI gfs)

🤣41💘15👍11💔7

11.5K views15:24

Chat GPT

OpenAI releases a new version of GPT-4 Turbo, to supposedly solve the “laziness” problem

👍22🤬3💔2🔥1😢1

8.79K views22:05

Chat GPT

Preliminary results showing that OpenAI’s newest GPT-4 Turbo upgrade has totally failed to solve the “laziness” problem

OpenAI must be either totally incompetent at benchmarking, or total liars.

Leaning toward the latter.

“Overall, the new gpt-4-0125-preview model does worse on the lazy coding benchmark as compared to the November gpt-4-1106-preview model”

Lazy coding benchmark for gpt-4-0125-preview

👨‍💻36👍15🤬12🗿5🤣4🏆3💔3🫡3

12.2K views22:09

Chat GPT

Google Bard’s image creation competitor to Dalle-3: No good

🤬18🤣7🤯4🏆2

6.99K views09:12

Chat GPT

Thanks ChatGPT

🤣69🎉6🤯5👍3😨3😐2

9.02K views09:14

Chat GPT

Expect ‘AI versus AI’ conflict soon, Pentagon cyber leader says

Article

😱24🥰9👍3💔2👨‍💻2🗿2

41.5K views13:15

Chat GPT

Employee tricked into sending $25 million to fraudsters after video call with deepfake ‘chief financial officer’

Article

🤣29😱7🥰4😨4👍3💔2🎉1

30.6K views19:17

Chat GPT

0:33

This media is not supported in your browser

VIEW IN TELEGRAM

Show me a feminist

🤣81💯5😨5👍4🤬3🗿3😍2❤‍🔥1

9.41K views23:09

Chat GPT

Preliminary results showing that OpenAI’s newest GPT-4 Turbo upgrade has totally failed to solve the “laziness” problem OpenAI must be either totally incompetent at benchmarking, or total liars. Leaning toward the latter. “Overall, the new gpt-4-0125-preview…

Sam says NOW they really got rid of the laziness

Unlike the last 3 times they claimed to have gotten rid of it, but hadn’t, trust-me-bro.

People are skeptical

🤬14🤣12

7.97K views05:39

Chat GPT

The AI future we hadn’t anticipated

🤣35🤬18💯5🗿5👍3

7.44K views05:40

Chat GPT

Why was it lazy Sam?

🤣36👍7🤬6🗿3

7.97K views05:41

Chat GPT

AI Laziness Problem

🤣39👍5🤬5🗿2

7.95K views05:42

Chat GPT

Princeton on ChatGPT-4 for real-world coding: Only 1.7% of the time generated a solution that worked.

“We therefore introduce SWE-bench, an evaluation framework including 2,294 software engineering problems drawn from real GitHub issues and corresponding pull requests across 12 popular Python repositories. Given a codebase along with a description of an issue to be resolved, a language model is tasked with editing the codebase to address the issue.“

“Our evaluations show that both state-of-the-art proprietary models and our fine-tuned model SWE-Llama can resolve only the simplest issues. Claude 2 and GPT-4 solve a mere 4.8% and 1.7% of instances respectively, even when provided with an oracle retriever”

Arxiv Paper

🤬18😎6

6.94K views00:48

Chat GPT

Google presents MusicRL: MusicRL is the first music generation system finetuned with RLHF

Paper

Project Page

🔥17👨‍💻5👀2

6.16K views02:01

Chat GPT

USPTO refuses OpenAI’s GPT trademark application

Surprisingly, OpenAI has lost their biggest trademark battle, to trademark “GPT”.

Case Doc

🔥21👍1

6.84K views22:59

Chat GPT

United States Patent and Trademark Office’s reasoning for refusing OpenAI’s application for trademarking “GPT”