Today, we're scaling up our open-source environments program to become the global hub for open evals and RL environments.
As part of this, we're committing hundreds of thousands of $ in grants and looking for partners who want to join our mission to accelerate open superintelligence.
In the past 2 months, we've crowdsourced 400+ frontier RL environments & evals through bounties and our RL residency, which had over 500+ applicants.
Through our new large-scale bounty program we will scale this to thousands of open-source environments.
Over the past few weeks, the community crowdsourced environments across:
Autonomous AI Research
Frontier Evals
Browser Automation
Theorem Proving
Subject-Specific QA
Legal/Finance Tasks
and many more domains...
Thank you to all the community members who've contributed an environment or joined the residency so far!
We’ve learned a lot over the past 2 months from our Bounties and RL Residency program — scaling past 400+ environments, with 80+ reviewed implementations and 500+ applications for the residency.
Today, we’re committing hundreds of thousands of $ as grants to sponsor builders around the world to help us grow the Environments Hub into the premier platform for open evals and RL environments.
Many of these grants will be in the form of bounties for completing specific environment implementations, such as popular evaluation suites, paper reproductions, or ecosystem integrations.
Prime Intellect Bounty Program - Environments, Evals, and Open Superintelligence
To scale the Environments Hub while maintaining high quality, the bounty program is split into two categories: Open Access and Application-Only Tasks.
Open Access tasks can be attempted by anyone, are intended for first-time builders within the Environments Hub ecosystem, and have smaller bounties in the $100-500 range.
Application-Only tasks are more complex in scope, and have larger bounties ranging from $1000 to $5000+. They are intended for those who are already comfortable building environments/evals, and will be assigned on an approval basis to builders who apply via this form: https://form.typeform.com/to/jLfT7v7o
Bounties are awarded upon successful completion of the task. For Application-Only tasks involving reimplementation of established benchmarks, this will entail full reproduction of known evaluation scores for a relevant subset of models.
We are also opening a Call for Partners for organizations who are eager to sponsor bounties via this program, or who are otherwise interested in participating (e.g. volunteering reviewer time, donating API credits, suggesting tasks for implementation, formalizing validation criteria, etc).
Sourcing environments for knowledge or skills in a particular domain of interest (e.g. medicine, materials science, systems programming)
Companies interested in sponsoring environments & evals built around their tools
Companies or organizations who are training their own models and are seeking custom RL environments.
Companies who are creating their own evals and interested in spotlighting work samples on the Environments Hub
Academic researchers who are creating their own novel benchmarks -- we are happy to sponsor evaluation costs for benchmarks whose reference implementation is built with verifiers and released on the Environments Hub.
Bounties sponsored by third-party organizations typically will fall within the Application-Only category, and we will assist in matching your task with capable and compatible builders from our community.
Next Steps
Our goal is to establish the Environments Hub as the global center of gravity for building, sharing and accessing RL environments & evals.
We’re excited to scale the scope, complexity and usefulness of environments on the hub to enable the open-source ecosystem to train increasingly competitive and useful models.
We’ve already been training and evaluating INTELLECT-3 on environments & evals contributed by the community over the last months to scale RL post-training by orders of magnitude of our previous efforts.
Let's open-source the tools to build superintelligence.
Eurus-2-PRIME
System Prompt
When tackling complex reasoning tasks, you have access to the following actions. Use them as needed to progress through your thought process. [ASSESS] [ADVANCE] [VERIFY] [SIMPLIFY] [SYNTHESIZE] [PIVOT] [OUTPUT] You should strictly follow the format below: [ACTION NAME] # Your action step 1 # Your action step 2 # Your action step 3 ... Next action: [NEXT ACTION NAME]
User
Quadratic polynomials $P(x)$ and $Q(x)$ have leading coefficients $2$ and $-2,$ respectively. The graphs of both polynomials pass through the two points $(16,54)$ and $(20,53).$ Find $P(0) + Q(0).$
Eurus-2-PRIME
[ASSESS]
We are given two quadratic polynomials $P(x)$ and $Q(x)$ with leading coefficients $2$ and $-2$, respectively. The graphs of both polynomials pass through the points $(16,54)$ and $(20,53)$. We need to find the value of $P(0) + Q(0)$.
[ADVANCE]
Let's express the quadratic polynomials $P(x)$ and $Q(x)$ in the general form:
$ P(x) = 2x^2 + bx + c $
$ Q(x) = -2x^2 + dx + e $
Since the graphs of both polynomials pass through the points $(16,54)$ and $(20,53)$, we can substitute these points into the equations of $P(x)$ and $Q(x)$ to form a system of equations.
For $P(x)$:
$ P(16) = 2(16)^2 + 16b + c = 54 $
$ 512 + 16b + c = 54 $
$ 16b + c = -458 \quad \text{(Equation 1)} $
$ P(20) = 2(20)^2 + 20b + c = 53 $
$ 800 + 20b + c = 53 $
$ 20b + c = -747 \quad \text{(Equation 2)} $
For $Q(x)$:
$ Q(16) = -2(16)^2 + 16d + e = 54 $
$ -512 + 16d + e = 54 $
$ 16d + e = 566 \quad \text{(Equation 3)} $
$ Q(20) = -2(20)^2 + 20d + e = 53 $
$ -800 + 20d + e = 53 $
$ 20d + e = 853 \quad \text{(Equation 4)} $
[ADVANCE]
Now, we can solve the system of equations for $P(x)$: Subtract Equation 1 from Equation 2:
$ (20b + c) - (16b + c) = -747 - (-458) $
$ 4b = -289 $
$ b = -\frac{289}{4} $
Substitute $b = -\frac{289}{4}$ into Equation 1:
$ 16\left(-\frac{289}{4}\right) + c = -458 $
$ -1156 + c = -458 $
$ c = 698 $
So, for $P(x)$, we have:
$ P(x) = 2x^2 - \frac{289}{4}x + 698 $
$ P(0) = 698 $
Next, we can solve the system of equations for $Q(x)$: Subtract Equation 3 from Equation 4:
$ (20d + e) - (16d + e) = 853 - 566 $
$ 4d = 287 $
$ d = \frac{287}{4} $
Substitute $d = \frac{287}{4}$ into Equation 3:
$ 16\left(\frac{287}{4}\right) + e = 566 $
$ 1148 + e = 566 $
$ e = -582 $
So, for $Q(x)$, we have:
$ Q(x) = -2x^2 + \frac{287}{4}x - 582 $
$ Q(0) = -582 $
[ADVANCE]
Finally, we can find $P(0) + Q(0)$:
$ P(0) + Q(0) = 698 + (-582) = 116 $
So, the value of $P(0) + Q(0)$ is $\boxed{116}$.
INTELLECT-MATH-7B-SFT (ours)
System Prompt
Solve the following math problem efficiently and clearly. Think carefully and step by step about your response and reason before providing a final response. Conclude your response with: \n\nTherefore, the final answer is: $\\boxed{answer}$. I hope it is correct.\n\nWhere [answer] is just the final number or expression that solves the problem.
User
Quadratic polynomials $P(x)$ and $Q(x)$ have leading coefficients $2$ and $-2,$ respectively. The graphs of both polynomials pass through the two points $(16,54)$ and $(20,53).$ Find $P(0) + Q(0).$
INTELLECT-MATH-7B-SFT
So I've got this problem here about quadratic polynomials $P(x)$ and $Q(x)$. $P(x)$ has a leading coefficient of $2$, and $Q(x)$ has a leading coefficient of $-2$. Both polynomials pass through the points $(16,54)$ and $(20,53)$. I need to find $P(0) + Q(0)$.
First, since both $P(x)$ and $Q(x)$ are quadratic polynomials, I can express them in the general form:
$P(x) = 2x^2 + bx + c$
$Q(x) = -2x^2 + dx + e$
Wait, actually, the leading coefficient is the coefficient of the highest power of $x$, which for a quadratic is $x^2$. So yes, $P(x) = 2x^2 + bx + c$ and $Q(x) = -2x^2 + dx + e$.
Now, both polynomials pass through the points $(16,54)$ and $(20,53)$. That means when $x=16$, $P(16)=54$, and when $x=20$, $P(20)=53$. Similarly for $Q(x)$.
So, I can set up equations using these points.
First, for $P(x)$:
$P(16) = 2(16)^2 + b16 + c = 54$
Similarly, $P(20) = 2(20)^2 + b20 + c = 53$
Let me calculate $16^2$ and $20^2$:
$16^2 = 256$
$20^2 = 400$
So,
$P(16): 2256 + 16b + c = 54 \Rightarrow 512 + 16b + c = 54 \Rightarrow 16b + c = 54 - 512 \Rightarrow 16b + c = -458$
$P(20): 2400 + 20b + c = 53 \Rightarrow 800 + 20b + c = 53 \Rightarrow 20b + c = 53 - 800 \Rightarrow 20b + c = -747$
Now, I have two equations:
1) $16b + c = -458$
2) $20b + c = -747$
I can subtract equation 1 from equation 2 to eliminate $c$:
$(20b + c) - (16b + c) = -747 - (-458)$
$20b + c - 16b - c = -747 + 458$
$4b = -289$
So, $b = -289 / 4 = -72.25$
Wait, but coefficients are usually integers, but maybe not. Let me check.
Now, plug $b$ back into one of the equations to find $c$. Let's use equation 1:
$16(-72.25) + c = -458$
$-1156 + c = -458$
$c = -458 + 1156 = 698$
So, $P(x) = 2x^2 - 72.25x + 698$
Hmm, $P(0)$ would be $c$, which is $698$.
Now, let's do the same for $Q(x)$.
$Q(16) = -2*(16)^2 + d16 + e = 54 \Rightarrow -2256 + 16d + e = 54 \Rightarrow -512 + 16d + e = 54 \Rightarrow 16d + e = 54 + 512 \Rightarrow 16d + e = 566$
$Q(20) = -2*(20)^2 + d20 + e = 53 \Rightarrow -2400 + 20d + e = 53 \Rightarrow -800 + 20d + e = 53 \Rightarrow 20d + e = 53 + 800 \Rightarrow 20d + e = 853$
Now, equations for $Q(x)$:
1) $16d + e = 566$
2) $20d + e = 853$
Subtract equation 1 from equation 2:
$(20d + e) - (16d + e) = 853 - 566$
$4d = 287$
$d = 287 / 4 = 71.75$
Now, plug $d$ back into equation 1:
$1671.75 + e = 566 \Rightarrow 1148 + e = 566 \Rightarrow e = 566 - 1148 = -582$
So, $Q(x) = -2x^2 + 71.75x - 582$
Therefore, $Q(0) = e = -582$
Now, $P(0) + Q(0) = 698 + (-582) = 116$
Wait, but the problem says "present the answer in LaTeX format: \boxed{Your answer}"
So, I should write $\boxed{116}$
But let me double-check my calculations to make sure I didn't make any mistakes.
First, for $P(x)$:
$P(16) = 2(256) + 16b + c = 512 + 16b + c = 54 \Rightarrow 16b + c = -458$
$P(20) = 2*(400) + 20b + c = 800 + 20b + c = 53 \Rightarrow 20b + c = -747$
$Q(16) = -2*(256) + 16d + e = -512 + 16d + e = 54 \Rightarrow 16d + e = 566$
$Q(20) = -2*(400) + 20d + e = -800 + 20d + e = 53 \Rightarrow 20d + e = 853$
Subtracting: $4d = 287 \Rightarrow d = 71.75$
Then, $e = 566 - 16*71.75 = 566 - 1148 = -582$
So, $P(0) = 698$, $Q(0) = -582$, sum is $116$.
Seems correct.
Final Answer
$\boxed{116}$
INTELLECT-MATH-7B (ours)
System Prompt
Solve the following math problem efficiently and clearly. Think carefully and step by step about your response and reason before providing a final response. Conclude your response with: \n\nTherefore, the final answer is: $\\boxed{answer}$. I hope it is correct.\n\nWhere [answer] is just the final number or expression that solves the problem.
User
Quadratic polynomials $P(x)$ and $Q(x)$ have leading coefficients $2$ and $-2,$ respectively. The graphs of both polynomials pass through the two points $(16,54)$ and $(20,53).$ Find $P(0) + Q(0).$
INTELLECT-MATH-7B
So I have two quadratic polynomials, $P(x)$ and $Q(x)$, with leading coefficients 2 and -2, respectively. Both pass through the points $(16,54)$ and $(20,53)$. I need to find $P(0) + Q(0)$.
First, since both are quadratic polynomials, I can express them in the general form:
$P(x) = 2x² + bx + c$
$Q(x) = -2x² + dx + e$
But since both pass through the points $(16,54)$ and $(20,53)$, I can set up equations using these points.
For $P(x)$:
$P(16) = 2*(16)² + b \cdot 16 + c = 54$
$P(20) = 2(20)² + b \cdot 20 + c = 53$
Similarly for $Q(x)$:
$Q(16) = -2(16)² + d \cdot 16 + e = 54$
$Q(20) = -2(20)² + d \cdot 20 + e = 53$
Let's compute the squares:
$16² = 256$
$20² = 400$
So for $P(x)$:
$2256 + 16b + c = 54 \Rightarrow 512 + 16b + c = 54 \Rightarrow 16b + c = 54 - 512 = -458 \quad ...(1)$
$2400 + 20b + c = 53 \Rightarrow 800 + 20b + c = 53 \Rightarrow 20b + c = 53 - 800 = -747 \quad ...(2)$
For $Q(x)$:
$-2256 + 16d + e = 54 \Rightarrow -512 + 16d + e = 54 \Rightarrow 16d + e = 54 + 512 = 566 \quad ...(3)$
$-2400 + 20d + e = 53 \Rightarrow -800 + 20d + e = 53 \Rightarrow 20d + e = 53 + 800 = 853 \quad ...(4)$
Now, I have two systems of linear equations:
For $P(x)$:
$16b + c = -458 \quad ...(1)$
$20b + c = -747 \quad ...(2)$
Subtract equation (1) from equation (2):
$(20b + c) - (16b + c) = -747 - (-458)$
$4b = -289 \Rightarrow b = -\frac{289}{4}$
Now plug $b$ back into equation (1):
$16\left(-\frac{289}{4}\right) + c = -458 \Rightarrow -1156 + c = -458 \Rightarrow c = -458 + 1156 = 698$
So $P(x) = 2x² - \frac{289}{4}x + 698$
For $Q(x)$:
$16d + e = 566 \quad ...(3)$
$20d + e = 853 \quad ...(4)$
Subtract equation (3) from equation (4):
$(20d + e) - (16d + e) = 853 - 566$
$4d = 287 \Rightarrow d = \frac{287}{4}$
Now plug $d$ back into equation (3):
$16\left(\frac{287}{4}\right) + e = 566 \Rightarrow 1148 + e = 566 \Rightarrow e = 566 -1148= -582$