Measuring and Benchmarking LLM Capabilities
Alexander Comerford
Date
September 24th, 2024
Time
6:30pm - 9:30pm
Location
The Hub at Office Logic
Event Description
π Join GPTuesdays and The GenAI Collective for another exciting event in our AI Speaker Series: "Measuring and Benchmarking LLM Capabilities." π₯ This session features Alexander Comerford, the second of two expert speakers, as he leads an in-depth exploration of key benchmarks that shape the performance of today's Large Language Models (LLMs): π What is a benchmark, and why does it matter for LLMs? π Deep dive into popular benchmarks like MMLU and MBPP π€ Agent benchmarks and their applications in AI systems π οΈ How to create your own LLM benchmarks tailored to your specific prompts β¨ Alex Comerford will take the stage as the second speaker, providing advanced insights on benchmarking principles and their practical applications in AI development, building on the foundational concepts introduced by the first speaker. π©βπ» Perfect for AI engineers, researchers, and enthusiasts, this session offers valuable insights into the evaluation frameworks that define success for modern AI systems. π Date: September 24th, 2024 π Time: 6:30pm to 9:30pm π Location: AI Center, Miami Dade College, Wolfson Campus, Building 2 πΊοΈ Event Venue: https://lnkd.in/g2NwDjE5 π FREE Parking (mention "AI Center Attendee"): https://lnkd.in/ghU8tr5k π Register Here (limited tickets!): https://lnkd.in/gQ8tv6Vv Donβt miss the chance to benchmark your LLM knowledge and connect with AI experts!