14 subscribers
Pergi ke luar talian dengan aplikasi Player FM !
Podcast Berbaloi untuk Didengar
DITAJA


Automated Evaluation of LLMs
Manage episode 416961426 series 3370867
Anand Kannappan is the cofounder and CEO of Patronus AI, an automated AI evaluation and security company. They have raised funding from Lightspeed Venture Partners, Replit CEO Amjad Masad, Gokul Rajaram, and Fortune 500 executives. He was previously at Meta and Vertis. He was also the cofounder of Kyber Technologies, which was a service to systematically predict market events using AI and remote sensing data. It evolved into a futures quant hedge fund managing $15M for partners.
Anand's favorite book: Harry Potter series (Author: JK Rowling)
(00:00) Introduction and Common Failure Modes of Large Language Models
(03:02) Challenges of Automated Evaluation in AI Models
(06:08) The Importance of Fine-Tuning and Retrieval Augmented Generation
(09:02) Addressing Copyright Detection in Language Models
(11:51) The Liability of Companies Using AI Models
(15:02) Advancements in Multimodal Models and State Space Models
(20:48) The Role of Fine-Tuning in the Evolution of Language Models
(23:51) The Significance of Adversarial Testing in AI
(25:56) The Role of Retrieval Augmented Generation in AI
(28:05) The Need for Continuous Function Optimization in Prompting
(29:02) Rapid Fire Round
--------
Where to find Prateek Joshi:
Newsletter: https://prateekjoshi.substack.com
Website: https://prateekj.com
LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19
Twitter: https://twitter.com/prateekvjoshi
169 episod
Manage episode 416961426 series 3370867
Anand Kannappan is the cofounder and CEO of Patronus AI, an automated AI evaluation and security company. They have raised funding from Lightspeed Venture Partners, Replit CEO Amjad Masad, Gokul Rajaram, and Fortune 500 executives. He was previously at Meta and Vertis. He was also the cofounder of Kyber Technologies, which was a service to systematically predict market events using AI and remote sensing data. It evolved into a futures quant hedge fund managing $15M for partners.
Anand's favorite book: Harry Potter series (Author: JK Rowling)
(00:00) Introduction and Common Failure Modes of Large Language Models
(03:02) Challenges of Automated Evaluation in AI Models
(06:08) The Importance of Fine-Tuning and Retrieval Augmented Generation
(09:02) Addressing Copyright Detection in Language Models
(11:51) The Liability of Companies Using AI Models
(15:02) Advancements in Multimodal Models and State Space Models
(20:48) The Role of Fine-Tuning in the Evolution of Language Models
(23:51) The Significance of Adversarial Testing in AI
(25:56) The Role of Retrieval Augmented Generation in AI
(28:05) The Need for Continuous Function Optimization in Prompting
(29:02) Rapid Fire Round
--------
Where to find Prateek Joshi:
Newsletter: https://prateekjoshi.substack.com
Website: https://prateekj.com
LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19
Twitter: https://twitter.com/prateekvjoshi
169 episod
Semua episod
×
1 What it Takes to Build a BI Platform | Colin Zima, CEO of Omni 40:07

1 Building Billing Infrastructure for AI Companies | Alvaro Morales, CEO of Orb 38:21

1 Turning Legal Services to APIs | Jay Madheswaran, CEO of Eve 41:02

1 Is LLM the New Operating System? | Anant Bhardwaj, CEO of Instabase 45:37

1 Building AI Agents That Actually Work | Malte Kosub, CEO of Parloa 33:54

1 3000 Customers, One Bold Pivot: Building the First Generative AI Copilot for Lawyers | Scott Stevenson, CEO of Spellbook 44:07

1 The Outer Loop of AI-Powered Coding | Merrill Lutsky, CEO of Graphite 41:26

1 Behind the Scenes of AI Video | Amit Jain, founder of Luma AI 48:19

1 Building an AI-Powered Terminal | Zach Lloyd 38:06

1 When Robots Go Haywire, Who Picks Up The Tab? | Amias Gerety 48:54

1 Building MotherDuck to a $400M Company 49:18

1 AI Agents Have Brains, But Where Are Their Wallets? 47:27

1 Building Autonomous Greenhouses with AI and Robotics 37:45

1 Developing Battery Materials with AI 33:27

1 Digital Replicas That Can Have Real Conversations 37:40

1 Breaking New Ground With Collaborative Robots 49:22

1 How to extract intelligence from speech data with AI 44:56

1 The Long Tail of AI: Understanding and Resolving Edge Cases 37:53

1 How Symbolic AI is Transforming Critical Infrastructure 38:08

1 AI Disruption: Startups vs Incumbents in the Tech Stack 46:57

1 Unpacking AI Startups: Metrics, Playbooks, and the Future 33:09

1 AI's Role In Physics, Chemistry, and Beyond 39:27

1 Discovering New Materials With AI 39:35

1 Designing Printed Circuit Boards With AI 39:26

1 Modifying Speech Accents In Real Time With AI 34:34

1 MANG VC "Round Trip" Phenomenon in AI 40:38

1 Building and Investing in Consumer AI 40:51

1 Biosimulation for Drug Development 32:03
Selamat datang ke Player FM
Player FM mengimbas laman-laman web bagi podcast berkualiti tinggi untuk anda nikmati sekarang. Ia merupakan aplikasi podcast terbaik dan berfungsi untuk Android, iPhone, dan web. Daftar untuk melaraskan langganan merentasi peranti.