| Model | AGIEval | GPT4All | TruthfulQA | Bigbench | Average |
|---|---|---|---|---|---|
| LLAMA_Harsha_8_B_ORDP_10k | 35.54 | 71.15 | 55.39 | 37.96 | 50.01 |
| Task | Version | Metric | Value | Stderr | |
|---|---|---|---|---|---|
| agieval_aqua_rat | 0 | acc | 26.77 | ± | 2.78 |
| acc_norm | 27.17 | ± | 2.80 | ||
| agieval_logiqa_en | 0 | acc | 31.34 | ± | 1.82 |
| Model | AGIEval | GPT4All | TruthfulQA | Bigbench | Average |
|---|---|---|---|---|---|
| LLAMA_Harsha_8_B_ORDP_10k | 35.54 | 71.15 | 55.39 | 37.96 | 50.01 |
| Task | Version | Metric | Value | Stderr | |
|---|---|---|---|---|---|
| agieval_aqua_rat | 0 | acc | 26.77 | ± | 2.78 |
| acc_norm | 27.17 | ± | 2.80 | ||
| agieval_logiqa_en | 0 | acc | 31.34 | ± | 1.82 |
| Model | AGIEval | GPT4All | TruthfulQA | Bigbench |
|---|---|---|---|---|
| Anthropic_RLFH_ORDP_40k | 30.55 | Error: File does not exist | 45.38 | 36.75 |
| Task | Version | Metric | Value | Stderr | |
|---|---|---|---|---|---|
| agieval_aqua_rat | 0 | acc | 21.26 | ± | 2.57 |
| acc_norm | 22.83 | ± | 2.64 | ||
| agieval_logiqa_en | 0 | acc | 28.11 | ± | 1.76 |
| #!/usr/bin/env python | |
| # -*- coding: utf-8 -*- | |
| """ | |
| Remove output from existing Jupyter Notebooks. | |
| Modified from remove_output by Minrk, damianavila, gabraganca. | |
| References: | |
| [0]: https://github.com/jupyter/nbformat | |
| [1]: http://nbformat.readthedocs.org/en/latest/index.html |