-
Notifications
You must be signed in to change notification settings - Fork 135
/
test_cases.json
37 lines (37 loc) · 2.08 KB
/
test_cases.json
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
[
{
"name": "Arena Bench Hard",
"system_prompt": "",
"query": "Write a Python program to build an RL model to recite text from any position that the user provides, using only numpy."
},
{
"name": "Big Code Bench",
"system_prompt": "",
"query": "Calculates the average of the sums of absolute differences between each pair of consecutive numbers for all permutations of a given list. Each permutation is shuffled before calculating the differences. Args: - numbers (list): A list of numbers. Default is numbers from 1 to 10. The function should output with: float: The average of the sums of absolute differences for each shuffled permutation of the list. You should write self-contained code starting with: ``` import itertools from random import shuffle def task_func(numbers=list(range(1, 3))): ```"
},
{
"name": "Maths Problem",
"system_prompt": "You are a helpful AI assistant with expertise in mathematical and logical problem-solving.",
"query": "Maximize x + y subject to: x + 2y <= 10, x >= 0, y >= 0"
},
{
"name": "GSM8K",
"system_prompt": "",
"query": "If there are 3 cars in the parking lot and 2 more cars arrive, how many cars are in the parking lot?"
},
{
"name": "r/LocalLLaMA",
"system_prompt": "",
"query" : "I have a dish of potatoes. The following statements are true: No potatoes of mine, that are new, have >been boiled. All my potatoes in this dish are fit to eat. No unboiled potatoes of mine are fit to eat. Are there any new potatoes in this dish?"
},
{
"name" : "reddit",
"system_prompt": "",
"query" : "There are 24 volunteers. Over the next 3 weeks, each volunteer is assigned to a different task. There are 8 tasks. Each week, the volunteers switch tasks. Each task has 3 volunteers assigned to it. Volunteers cannot be assigned to the same task more than once, and volunteers cannot share the same task more than once."
},
{
"name" : "GH",
"system_prompt" : "",
"query" : "Find the largest possible real part of[(75+117i)z+\frac{96+144i}{z}]where z is a complex number with |z|=4"
}
]