Data Science
Events
-
-
Designing Data Infrastructures for Multimodal Mobility Datasets
Online event***Please Use Source Link Below to Confirm Event Details*** This technical workshop focuses on the data infrastructure required to build and maintain production-grade mobility datasets at fleet scale. We will examine how to structure storage, metadata, access patterns, and quality controls so that mobility teams can treat perception datasets as first-class, versioned “infrastructure” assets. The session will walk through how to design a mobility data stack that connects object storage, labeling systems, simulation environments, and experiment tracking into a coherent, auditable pipeline. Original Event: Designing Data Infrastructures for Multimodal Mobility Datasets
Free -
Teaching Computers to Read: Dataset Curation Impact on Model Performance
Online event***Please Use Source Link Below to Confirm Event Details*** Workshop Summary: Successful AI solutions aren’t about chasing the newest model - it’s about solving the right problems in the right way. The book “Teaching Computers to Read” (out November 5 from CRC Press) focuses on what technical teams need to design, develop, deploy, and maintain useful NLP and AI solutions. Drawing on real-world experience and examples, the book offers actionable best practices to deliver adaptable, reliable AI systems that address business challenges with lasting, tangible value. In this tutorial, we will walk through one part of the Code Companion for the book. We will review the corpus distribution and variation, our annotated data distribution, and explore how our curated datasets impact the performance of different technical approaches, using information extraction as an example. The concepts covered in the tutorial are covered in more detail in the book, and there are additional exercises in the Code Companion for those interested in going beyond the tutorial session. Original Event: Teaching Computers to Read: Dataset Curation Impact on Model Performance
Free