OpenAI believes human-like SuperAI is coming sooner than expected, plans to control, capitalise on it

OpenAI proposes a very brave new method to control human-like SuperAI. Ilya Sutskever and his team at OpenAI believe that they can use a less sophisticated AI model, but solidly built model like GPT-2 to guide a more advanced model like GPT-4 or beyond, to behave properly

Despite recent internal shakeups at OpenAI, the Superalignment team, led by Ilya Sutskever, remains steadfast in its mission to develop strategies for steering and regulating superintelligent AI systems. This team, formed in July, is tackling the complex challenge of aligning AI models that surpass human intelligence. While some sceptics argue that the focus on superintelligent AI is premature, the Superalignment team is actively exploring governance and control frameworks to address the potential risks associated with highly intelligent systems, as reported by TechCrunch.

The Superalignment team, currently comprised of Collin Burns, Pavel Izmailov, and Leopold Aschenbrenner, presented their latest work at the NeurIPS conference. Their approach involves using a less sophisticated AI model (e.g., GPT-2) to guide a more advanced model (e.g., GPT-4) toward desired behaviours and away from undesirable ones. This analogy, where the weak model represents human supervisors and the strong model symbolizes superintelligent AI, aims to explore alignment hypotheses in a controlled manner. The team is focused on instructing AI models effectively, ensuring they follow given instructions, and verifying the safety and accuracy of generated outputs. The Superalignment team acknowledges the challenges of aligning models that surpass human intelligence and emphasizes the importance of research in addressing this critical issue. To encourage collaboration and innovation in the field, OpenAI is launching a $10 million grant program for technical research on superintelligent alignment. The program will allocate funds to academic labs, nonprofits, individual researchers, and graduate students. Former Google CEO Eric Schmidt, a supporter of OpenAI and advocate for AI research, is contributing to the funding. OpenAI also plans to host an academic conference on super alignment in early 2025 to share and promote research findings. The Superalignment team is committed to sharing its research, including code, with the public. The team’s mission aligns with OpenAI’s overarching goal of ensuring AI benefits humanity safely. The involvement of Schmidt, whose commercial interests in AI have been noted, raises questions about the commercial and ethical implications of OpenAI’s superalignment research. Nevertheless, the team remains dedicated to contributing to the safety and benefit of advanced AI for the broader community. (With inputs from agencies)

OpenAI believes human-like SuperAI is coming sooner than expected, plans to control, capitalise on it

OpenAI proposes a very brave new method to control human-like SuperAI. Ilya Sutskever and his team at OpenAI believe that they can use a less sophisticated AI model, but solidly built model like GPT-2 to guide a more advanced model like GPT-4 or beyond, to behave properly

Top Stories

Russian drones over Poland: Trump’s tepid reaction a wake-up call for Nato?

As Russia pushes east, Ukraine faces mounting pressure to defend its heartland

Why Mossad was not on board with Israel’s strike on Hamas in Qatar

Turkey: Erdogan's police arrest opposition mayor Hasan Mutlu, dozens officials in corruption probe