UQLM provides a suite of response-level scorers for quantifying the uncertainty of Large Language Model (LLM) outputs. Each scorer returns a confidence score between 0 and 1, where higher scores ...
Abstract: Producing executable code from natural-language directives via Large Language Models (LLMs) involves obstacles like semantic uncertainty and the requirement for task-focused context ...
Abstract: Charge prediction is a critical task in judicial AI, involving the determination of criminal charges through detailed analysis of case narratives. Existing methods often face high ...