dyval.dyval_utils¶

promptbench.dyval.dyval_utils.dyval_evaluate(dataset_type, preds, gts)¶

Evaluates predictions against ground truths for different dataset types.

Parameters:¶

: float

The accuracy of predictions as a proportion of correct answers.

promptbench.dyval.dyval_utils.process_dyval_inputs(prompt, dataset)¶

Processes inputs for dynamic value (DyVal) dataset.

promptstr: The prompt template to be formatted.
datasetDyValDataset: The dataset containing descriptions and other relevant data.

: dict

A dictionary of processed inputs organized by order.

promptbench.dyval.dyval_utils.process_dyval_preds(raw_pred)¶

Processes the raw prediction string to extract the predicted value.

: str

The extracted prediction.

promptbench.dyval.dyval_utils.process_dyval_training_sample(sample, dataset_type)¶

promptbench.dyval.dyval_utils.round_value(val)¶

Rounds a numerical value to 8 decimal places.

: str

The rounded value as a string.