Mastering LLM Chatbot Testing: Metrics, Methods and Mistakes to Avoid | James Massa | #Testflix 2024

Mastering LLM Chatbot Testing: Metrics, Methods and Mistakes to Avoid | James Massa | #Testflix 2024

In this session, James Massa, Senior Executive Director of Software Engineering and Architecture at JPMorgan Chase, dives into the critical elements of mastering LLM (Large Language Model) chatbot testing. As AI-driven chatbots take center stage in user interactions, even the slightest flaw in testing can result in significant, high-profile failures. James will guide you through the cutting-edge techniques used to rigorously test these advanced systems, ensuring optimal performance while helping you avoid common pitfalls that could derail your chatbot’s success.

You’ll learn about the core architecture behind LLM chatbots and the components that need careful evaluation. James will also walk you through essential testing metrics, how to apply them effectively, and how to troubleshoot real-world issues like outdated knowledge graphs or fine-tuning errors. Through real case studies of chatbot failures, you’ll gain valuable insights into how proper testing could have prevented these issues. This session is a must for anyone involved in testing, engineering, or AI development, providing you with actionable strategies to future-proof your chatbot systems and stay ahead in the rapidly evolving AI landscape.

This video is of one of the Talks presented at #TestFlix - Biggest Virtual Software #Testing Conference, 2024.

#softwaretesting #automationtesting #testautomation #testflix2024 #testingchallenges

About Speaker:
James Massa is a frequent international conference speaker who presented two IEEE Blockchain security publications in Denmark in August. He is a serial innovator with 5 patents for AI, automated testing, FinOps and data quality. He won the 2024 FSTech award for leading the Best Financial Services IT Team. He is the repeat winner of the American Financial Technology Award for Best Compliance Initiative and the winner of the FF Banking Tech award for Best Reg Tech. James holds master’s degrees in digital design from Harvard University and in finance from Baruch.

Connect with James on LinkedIn -
https://www.linkedin.com/in/jamesmassa/

TestFlix 2024 Proud Sponsors:

BrowserStack - https://www.browserstack.com/
Element 34 - https://www.element34.com/
UiPath - https://www.uipath.com/
Avo Automation - https://avoautomation.ai/
Autify - https://autify.com/
Launchable - https://www.launchableinc.com/
GSPANN - https://www.gspann.com/
Katalon - https://katalon.com/
Reflect - https://reflect.run/
Yethi - https://yethi.in/
PractiTest - https://www.practitest.com/
TestGuild - https://testguild.com/
Functionize - https://www.functionize.com/

Learn from industry experts with Thrive.now courses, grow your network with software testers at The Test Tribe events, and become a member of Asia's largest testing community on Discord.

Upskill yourself with Thrive.now courses: https://bit.ly/thrivettt

Grow your network with software testers with the events at The Test Tribe: https://bit.ly/tttevents

Become a member of Asia's largest testing community: https://bit.ly/3FONxJP

About The Test Tribe:
The Test Tribe is a leading global Software Testing Community (proudly Asia’s Largest) turned EdTech Startup. Started in 2018 with a mission to give Testing Craft the glory it deserves while we co-create Smarter, Prouder, and Confident Testers. We take pride in creating unique global Events, Online Community spaces, and eLearning platforms where software testers collaborate, learn, and grow globally. With around 230+ Software Testing Events like Conferences, Hackathons, Meetups, Webinars, etc., and with other Community initiatives, we have reached a global footprint of over 80K+ Testers. We intend to be the top destination of choice for Testers across the globe for their upskilling and community needs.