GPTuesdays

Measuring and Benchmarking LLM Capabilities

Alexander Comerford

Machine Learning Engineer at NEAR.AI

Date

September 24th, 2024

Time

6:30pm - 9:30pm

Location

The Hub at Office Logic

Event Description

🎙 Join GPTuesdays and The GenAI Collective for another exciting event in our AI Speaker Series: "Measuring and Benchmarking LLM Capabilities." 💥 This session features Alexander Comerford, the second of two expert speakers, as he leads an in-depth exploration of key benchmarks that shape the performance of today's Large Language Models (LLMs): 📊 What is a benchmark, and why does it matter for LLMs? 🔍 Deep dive into popular benchmarks like MMLU and MBPP 🤖 Agent benchmarks and their applications in AI systems 🛠️ How to create your own LLM benchmarks tailored to your specific prompts ✨ Alex Comerford will take the stage as the second speaker, providing advanced insights on benchmarking principles and their practical applications in AI development, building on the foundational concepts introduced by the first speaker. 👩‍💻 Perfect for AI engineers, researchers, and enthusiasts, this session offers valuable insights into the evaluation frameworks that define success for modern AI systems. 📅 Date: September 24th, 2024 🕒 Time: 6:30pm to 9:30pm 📍 Location: AI Center, Miami Dade College, Wolfson Campus, Building 2 🗺️ Event Venue: https://lnkd.in/g2NwDjE5 🚗 FREE Parking (mention "AI Center Attendee"): https://lnkd.in/ghU8tr5k 🎟 Register Here (limited tickets!): https://lnkd.in/gQ8tv6Vv Don’t miss the chance to benchmark your LLM knowledge and connect with AI experts!

Alexander Comerford

Machine Learning Engineer at NEAR.AI