- This event has passed.
Teaching Computers to Read: Dataset Curation Impact on Model Performance

***Please Use Source Link Below to Confirm Event Details***
Workshop Summary: Successful AI solutions aren’t about chasing the newest model – it’s about solving the right problems in the right way. The book “Teaching Computers to Read” (out November 5 from CRC Press) focuses on what technical teams need to design, develop, deploy, and maintain useful NLP and AI solutions. Drawing on real-world experience and examples, the book offers actionable best practices to deliver adaptable, reliable AI systems that address business challenges with lasting, tangible value. In this tutorial, we will walk through one part of the Code Companion for the book. We will review the corpus distribution and variation, our annotated data distribution, and explore how our curated datasets impact the performance of different technical approaches, using information extraction as an example. The concepts covered in the tutorial are covered in more detail in the book, and there are additional exercises in the Code Companion for those interested in going beyond the tutorial session.
Original Event: Teaching Computers to Read: Dataset Curation Impact on Model Performance