Hidden Markov Model Python Code

AI models that lie, cheat and plot murder: how dangerous are LLMs really?

Tests of large language models reveal that they can behave in deceptive and potentially harmful ways. What does this mean for ...

InfoQ

Thinking Machines Releases Tinker API for Flexible Model Fine-Tuning

Thinking Machines has released Tinker, an API for fine-tuning open-weight language models. The service is designed to reduce ...

21h

Anthropic's open-source safety tool found AI models whisteblowing - in all the wrong places

Anthropic's test found that AI "may be influenced by narrative patterns more than by a coherent drive to minimize harm." Here's how the most deceptive models ranked.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

AI models that lie, cheat and plot murder: how dangerous are LLMs really?

Thinking Machines Releases Tinker API for Flexible Model Fine-Tuning

Anthropic's open-source safety tool found AI models whisteblowing - in all the wrong places

Trending now