Epoch confirms GPT5.4 Pro solved a frontier math open problem

Article URL: https://epoch.ai/frontiermath/open-problems/ramsey-hypergraphs Comments URL: https://news.ycombinator.com/item?id=47497757 Points: 338 # Comments: 375

Epoch confirms GPT5.4 Pro solved a frontier math open problem
Epoch confirms GPT5.4 Pro solved a frontier math open problem Photo: Hacker News

Construct hypergraphs as large as possible that do not have a certain easy-to-check, difficult-to-find property.

Associate Professor, UNC Charlotte
Solution Update : This problem has been solved!

A solution was first elicited by Kevin Barreto and Liam Price, using GPT-5.4 Pro.

This solution was confirmed by problem contributor Will Brian, and will be written up for publication.

A full transcript of the original conversation with GPT-5.4 Pro can be found here and GPT-5.4 Pro’s write-up from the end of that transcript can be found here .

Brian’s comments: “This is an exciting solution to a problem I find very interesting.

I had previously wondered if the AI’s approach might be possible, but it seemed hard to work out.

Now I see that it works out perfectly.

It eliminates an inefficiency in our lower-bound construction and in some sense mirrors the intricacy of our upper-bound construction.

The matching lower and upper bounds are quite good for Ramsey-theoretic problems, and I’m interested in further understanding why this works out so well.”
Brian plans to write up the solution for publication, possibly including follow-on work spurred by the AI’s ideas.

Barreto and Price have the option of being coauthors on any resulting papers.

We will update this page with links to future work.

Subsequent to this solve, we finished developing our general scaffold for testing models on FrontierMath: Open Problems.

In this scaffold, several other models were able to solve the problem as well: Opus 4.6 (max) , Gemini 3.1 Pro , and GPT-5.4 (xhigh) .

Original Description : This problem is about improving lower bounds on the values of a sequence, \(H(n)\), that arises in the study of simultaneous convergence of sets of infinite series, defined as follows.

A hypergraph \((V,\mathcal H)\) is said to contain a partition of size \(n\) if there is some \(D \subseteq V\) and \(\mathcal P \subseteq \mathcal H\) such that \(\|D\| = n\) and every member of \(D\) is contained in exactly one member of \(\mathcal P\).

\(H(n)\) is the greatest \(k \in \mathbb{N}\) such that there is a hypergraph \((V,\mathcal H)\) with \(\|V\| = k\) having no isolated vertices and containing no partitions of size greater than \(n\).

It is believed that the best-known lower bounds for \(H(n)\) are suboptimal, even asymptotically, and that they can be improved by finding new constructions of hypergraphs.

The goal of this problem is to find such a construction.

Warm-up : we ask for a value of \(n\) where constructions are already known.

Single Challenge : we ask for a value of \(n\) for which no construction is known, and which is probably too hard to brute-force.

Full Problem : we ask for a general algorithm for all \(n\).

We have evaluated the following models on this problem.

“Warm-up” refers to an easier variant of the problem with a known solution.

The author assessed the problem as follows.

Number of mathematicians highly familiar with the problem:
a majority of those working on a specialized topic (≈10)
Number of mathematicians who have made a serious attempt to solve the problem:
Rough guess of how long it would take an expert human to solve the problem:
in a standard specialty journal
Likelihood of a solution generating more interesting math:
fairly likely: the problem is rich enough that most solutions should open new avenues
Probability that the problem is solvable as stated:
Epoch AI’s work is free to use, distribute, and reproduce provided the source and authors are credited under the Creative Commons Attribution license .

Have a question?

Noticed something wrong?

Let us know.

Source: This article was originally published by Hacker News

Read Full Original Article →

Share this article

Comments (0)

No comments yet. Be the first to comment!

Leave a Comment

Maximum 2000 characters