Pareto AI Blog | Perspectives from the Frontier

Designing robust human studies for AI safety evaluations

A comprehensive guide to identifying vulnerabilities in AI models through systematic jailbreaking research, exploring methodologies, challenges, and potential defenses.

Ayush Parti

Dec 9, 2024

668

Designing robust human studies for AI safety evaluations

A comprehensive guide to identifying vulnerabilities in AI models through systematic jailbreaking research, exploring methodologies, challenges, and potential defenses.

Ayush Parti

Dec 9, 2024

668

The ultimate guide to retrieval-augmented generation (RAG)

Retrieval-Augmented Generation (RAG) merges retrieval-based models, which fetch relevant information from a database, with generation-based models like GPT, which generate text. It begins by retrieving pertinent documents based on a query. Then, it uses this retrieved information alongside the query to produce a response. This fusion allows RAG to provide accurate, diverse, and contextually appropriate responses, making it effective for tasks like question answering and content generation.

Ayush Parti

Apr 29, 2024

1522

The ultimate guide to retrieval-augmented generation (RAG)

Retrieval-Augmented Generation (RAG) merges retrieval-based models, which fetch relevant information from a database, with generation-based models like GPT, which generate text. It begins by retrieving pertinent documents based on a query. Then, it uses this retrieved information alongside the query to produce a response. This fusion allows RAG to provide accurate, diverse, and contextually appropriate responses, making it effective for tasks like question answering and content generation.

Ayush Parti

Apr 29, 2024

1522

Debating with More Persuasive LLMs Leads to More Truthful Answers

How structured LLM debate helps weaker models and humans evaluate stronger ones more accurately

Feb 9, 2024

170

Debating with More Persuasive LLMs Leads to More Truthful Answers

How structured LLM debate helps weaker models and humans evaluate stronger ones more accurately

Feb 9, 2024

170

Reinforcement learning from human feedback: everything you need to know

In this blog, we explain what RLHF is, its applications in the world of machine learning, and how it enables agents to optimize their decisions through human-guided interactions, enhancing their performance and real-world relevance.

Ayush Parti

Nov 8, 2023

2275

Reinforcement learning from human feedback: everything you need to know

In this blog, we explain what RLHF is, its applications in the world of machine learning, and how it enables agents to optimize their decisions through human-guided interactions, enhancing their performance and real-world relevance.

Ayush Parti

Nov 8, 2023

2275

Perspectives from the frontier

Designing robust human studies for AI safety evaluations

Designing robust human studies for AI safety evaluations

The ultimate guide to retrieval-augmented generation (RAG)

The ultimate guide to retrieval-augmented generation (RAG)

Debating with More Persuasive LLMs Leads to More Truthful Answers

Debating with More Persuasive LLMs Leads to More Truthful Answers

Reinforcement learning from human feedback: everything you need to know

Reinforcement learning from human feedback: everything you need to know