UQLM provides a suite of response-level scorers for quantifying the uncertainty of Large Language Model (LLM) outputs. Each scorer returns a confidence score between 0 and 1, where higher scores ...
The code is validated on python 3.10.14 + CUDA 11.8 + pyTorch 2.4.0. It should work for newer python, CUDA and pyTorch versions but not guaranteed. conda create -n ...