promptbench
latest

Get Started

  • promptbench Introduction
  • Installation

Examples

  • Basic Usage
  • Multi-Modal Models
  • DyVal Evaluation
  • Prompt Attack
  • Prompt Engineering
  • Adding new modules

Reference

  • dataload
  • dyval
    • dyval.dyval_dataset
    • dyval.dyval_utils
    • dyval.DAG
  • metrics
  • models
  • utils

Leaderboards

  • Adversarial Prompt Leaderboard
  • Dynamic Evaluation Benchmark
  • Prompt Engineering Benchmark
promptbench
  • dyval
  • Edit on GitHub

dyvalΒΆ

  • dyval.dyval_dataset
    • DyValDataset
  • dyval.dyval_utils
    • dyval_evaluate()
    • process_dyval_inputs()
    • process_dyval_preds()
    • process_dyval_training_sample()
    • round_value()
  • dyval.DAG
    • dyval.DAG.code_dag
    • dyval.DAG.dag
    • dyval.DAG.describer
    • dyval.DAG.logic_dag
    • dyval.DAG.math_dag
Previous Next

© Copyright 2023, Kaijie Zhu. Revision 0403a12e.

Built with Sphinx using a theme provided by Read the Docs.