114 - Behavioral Testing Of NLP Models, With Marco Tulio Ribeiro NLP Highlights podcast

Artwork

Artificial Intelligence Tech Science NLP Highlights Allen Institute for Artificial Intelligence Tell Us

Kandungan disediakan oleh NLP Highlights and Allen Institute for Artificial Intelligence. Semua kandungan podcast termasuk episod, grafik dan perihalan podcast dimuat naik dan disediakan terus oleh NLP Highlights and Allen Institute for Artificial Intelligence atau rakan kongsi platform podcast mereka. Jika anda percaya seseorang menggunakan karya berhak cipta anda tanpa kebenaran anda, anda boleh mengikuti proses yang digariskan di sini https://ms.player.fm/legal.

NLP Highlights « »
114 - Behavioral Testing of NLP Models, with Marco Tulio Ribeiro

4+ y ago 43:32

Kongsi

MP3•Laman utama episod

Kandungan disediakan oleh NLP Highlights and Allen Institute for Artificial Intelligence. Semua kandungan podcast termasuk episod, grafik dan perihalan podcast dimuat naik dan disediakan terus oleh NLP Highlights and Allen Institute for Artificial Intelligence atau rakan kongsi platform podcast mereka. Jika anda percaya seseorang menggunakan karya berhak cipta anda tanpa kebenaran anda, anda boleh mengikuti proses yang digariskan di sini https://ms.player.fm/legal.

We invited Marco Tulio Ribeiro, a Senior Researcher at Microsoft, to talk about evaluating NLP models using behavioral testing, a framework borrowed from Software Engineering. Marco describes three kinds of black-box tests the check whether NLP models satisfy certain necessary conditions. While breaking the standard IID assumption, this framework presents a way to evaluate whether NLP systems are ready for real-world use. We also discuss what capabilities can be tested using this framework, how one can come up with good tests, and the need for an evolving set of behavioral tests for NLP systems. Marco’s homepage: https://homes.cs.washington.edu/~marcotcr/

… continue reading

145 episod

#Artificial Intelligence #Tech #Science #NLP Highlights #Allen Institute for Artificial Intelligence #Tell Us

Artwork

114 - Behavioral Testing of NLP Models, with Marco Tulio Ribeiro

286 subscribers

published 4+ y ago

Kongsi

MP3•Laman utama episod

Kandungan disediakan oleh NLP Highlights and Allen Institute for Artificial Intelligence. Semua kandungan podcast termasuk episod, grafik dan perihalan podcast dimuat naik dan disediakan terus oleh NLP Highlights and Allen Institute for Artificial Intelligence atau rakan kongsi platform podcast mereka. Jika anda percaya seseorang menggunakan karya berhak cipta anda tanpa kebenaran anda, anda boleh mengikuti proses yang digariskan di sini https://ms.player.fm/legal.

We invited Marco Tulio Ribeiro, a Senior Researcher at Microsoft, to talk about evaluating NLP models using behavioral testing, a framework borrowed from Software Engineering. Marco describes three kinds of black-box tests the check whether NLP models satisfy certain necessary conditions. While breaking the standard IID assumption, this framework presents a way to evaluate whether NLP systems are ready for real-world use. We also discuss what capabilities can be tested using this framework, how one can come up with good tests, and the need for an evolving set of behavioral tests for NLP systems. Marco’s homepage: https://homes.cs.washington.edu/~marcotcr/

… continue reading

145 episod

#Artificial Intelligence #Tech #Science #NLP Highlights #Allen Institute for Artificial Intelligence #Tell Us

Semua episod

×

Selamat datang ke Player FM

Player FM mengimbas laman-laman web bagi podcast berkualiti tinggi untuk anda nikmati sekarang. Ia merupakan aplikasi podcast terbaik dan berfungsi untuk Android, iPhone, dan web. Daftar untuk melaraskan langganan merentasi peranti.

Dengarkan lebih 500+ topik

Panduan Rujukan Pantas

Podcast Teratas

Bantuan/Soalan Lazim | Naik taraf | Iklankan

Seni|Perniagaan|Komedi|Ekonomi|Hiburan|Berita|Politik|Agama

Sains|Bolasepak|Sukan|Bercerita|Teknologi|True Crime

Hak Cipta 2025 | Peta Laman | Dasar Privasi | Syarat Perkhidmatan | | hak cipta

Dengar rancangan ini semasa anda meneroka