SEE PRICING & PACKAGES

Monday, September 23, 2024 - 8:30am to 12:00pm

Evaluating and Testing Generative AI: Insights and Strategies

Generative AI (GenAI), exemplified by groundbreaking systems like ChatGPT and LLAMA, is revolutionizing the software landscape. These advanced technologies represent some of the most sophisticated software ever devised, capable of navigating an unprecedented range of prompts and questions, many of which have never been posed in human history. Their ability to generate varied responses to the same query and even fabricate answers when uncertain poses unique challenges in verification and testing. This talk delves into the intricacies of validating such systems and identifies areas needing enhancement. In stark contrast to traditional software with its well-defined inputs and outputs, GenAI operates on a different paradigm. Traditional software and even standard neural networks are meticulously designed by humans to yield specific results based on given inputs. GenAI, however, diverges significantly, being predominantly trained on vast text corpora. The surprising effectiveness of Large Language Models (LLMs), even to their creators, has sparked intense debates regarding the nature of GenAI – is it a genuine form of intelligence or consciousness, or merely a sophisticated pattern of statistical string outputs that we imbue with human-like qualities?

This workshop is crucial for a wide spectrum of attendees – from individuals who use ChatGPT casually, to engineers and managers engaged in developing software with LLMs or GenAI. Whether you are intrigued, enthusiastic, or concerned about the advancements in GenAI, this session is an essential platform to understand its complexities, challenges, and the ongoing discourse.

Pre-Requirement: Access to ChatGPT4 (or equivalent system).

Jason_Arbon
Testers.AI

Jason Arbon is a serial founder and AI practitioner focused on building and validating real products at scale. He previously founded Test.ai, an AI-first software testing company funded by Google, where he worked on applying machine learning to automate quality, relevance, and user experience validation across large, complex products. Today, he is the founder and CEO of Testers.AI, where he is advancing the next generation of AI-driven testing and evaluation—using AI agents and synthetic users to continuously test products, marketing, and user experiences in real-world conditions. Jason’s work is grounded in hands-on experience using AI to rapidly build, test, launch, and iterate businesses, and he brings a practical, execution-focused perspective on how AI can accelerate the entire lifecycle from idea to operating company.