Large-Scale Evaluation of Generative Shopping Assistants
Scalable evaluation framework to assess factual accuracy, safety, and quality across millions of LLM-generated shopping answers.
Scalable evaluation framework to assess factual accuracy, safety, and quality across millions of LLM-generated shopping answers.
A browser-based agentic system that helps customers complete purchases across third-party websites using multimodal perception, reasoning, and tool-enabled actions.
Developed computer vision models to automatically quantify PD-L1 expression in immunohistochemistry slides.
We investigated to use electrodermal activity (EDA), heart rate variability (HRV), and facial expression analysis as potential endpoints to determine quantitative pain scores.
The SNAPSHOT study seeks to measure Sleep, Networks, Affect, Performance, Stress, and Health using Objective Techniques.