- Chris Wood hits quickfire double in NZ World Cup qualifying romp
- Markets struggle at end of tough week
- China tests building Moon base with lunar soil bricks
- Film's 'search for Palestine' takes centre stage at Cairo festival
- Oil execs work COP29 as NGOs slam lobbyist presence
- Gore says climate progress 'won't slow much' because of Trump
- 'Megaquake' warning hits Japan's growth
- Stiff business: Berlin startup will freeze your corpse for monthly fee
- Wars, looming Trump reign set to dominate G20 summit
- Xi, Biden attend Asia-Pacific summit, prepare to meet
- Kyrgios to make competitive return at Brisbane next month after injuries
- Dominican Juan Luis Guerra triumphs at 25th annual Latin Grammys
- Landslide win for Sri Lanka president's leftist coalition in snap polls
- Australian World Cup penalty hero Vine takes mental health break
- As Philippines picks up from Usagi, a fresh storm bears down
- Tropical Storm Sara pounds Honduras with heavy rain
- Pepi gives Pochettino win for USA in Jamaica
- 'Hell to heaven' as China reignite World Cup hopes with late winner
- Rebel attacks keep Indian-run Kashmir on the boil
- New Zealand challenge 'immense but fantastic' for France
- Under pressure England boss Borthwick in Springboks' spotlight
- All Blacks plan to nullify 'freakish' Dupont, says Lienert-Brown
- TikTok makes AI driven ad tool available globally
- Japan growth slows as new PM readies stimulus
- China retail sales pick up speed, beat forecasts in October
- Asian markets fluctuate at end of tough week
- Gay, trans people voicing -- and sometimes screaming -- Trump concerns
- Argentina fall in Paraguay, Brazil held in Venezuela
- N. Korean leader orders 'mass production' of attack drones
- Pakistan's policies hazy as it fights smog
- Nature pays price for war in Israel's north
- New Zealand's prolific Williamson back for England Test series
- Mexico City youth grapple with growing housing crisis
- After Trump's victory, US election falsehoods shift left
- Cracks deepen in Canada's pro-immigration 'consensus'
- Xi inaugurates South America's first Chinese-funded port in Peru
- Tyson slaps Paul in final face-off before Netflix bout
- England wrap-up T20 series win over West Indies
- Stewards intervene to stop Israel, France football fans clash at Paris match
- Special counsel hits pause on Trump documents case
- Japan's Princess Mikasa, great aunt to emperor, dies aged 101
- Cricket at 2028 Olympics could be held outside Los Angeles
- Trump names vaccine skeptic RFK Jr. to head health dept
- Ye claims 'Jews' controlling Kardashian clan: lawsuit
- Japan into BJK Cup quarter-finals as Slovakia stun USA
- Sri Lanka president's party headed for landslide: early results
- Olympics 'above politics' say LA 2028 organisers after Trump win
- Panic strikes Port-au-Prince as residents flee gang violence
- Carsley hails England's strength in depth as understudies sink Greece
- Undefeated Chiefs lose kicker Butker to knee injury
JRI | -0.23% | 13.21 | $ | |
BCC | -1.57% | 140.35 | $ | |
NGG | 0.4% | 62.37 | $ | |
SCS | -0.75% | 13.27 | $ | |
AZN | -0.38% | 65.04 | $ | |
GSK | -2.09% | 34.39 | $ | |
RBGPF | 100% | 61.84 | $ | |
CMSD | -0.02% | 24.725 | $ | |
CMSC | -0.24% | 24.55 | $ | |
RIO | -0.31% | 60.43 | $ | |
BTI | 0.2% | 35.49 | $ | |
RYCEF | -4.71% | 6.79 | $ | |
BCE | -1.38% | 26.84 | $ | |
RELX | -0.37% | 45.95 | $ | |
VOD | -0.81% | 8.68 | $ | |
BP | 1.65% | 29.05 | $ |
OpenAI releases reasoning AI with eye on safety, accuracy
ChatGPT creator OpenAI on Thursday released a new series of artificial intelligence models designed to spend more time thinking -- in hopes that generative AI chatbots provide more accurate and beneficial responses.
The new models, known as OpenAI o1-Preview, are designed to tackle complex tasks and solve more challenging problems in science, coding and mathematics -- something that earlier models have been criticized for failing to provide consistently.
Unlike their predecessors, these models have been trained to refine their thinking processes, try different methods and recognize mistakes before they deploy a final answer.
The new release comes as OpenAI is raising funds that could see it valued around $150 billion, which would make it one of the world's most valuable private companies, according to US media.
Investors include Microsoft and Nvidia, and could also include a $7 billion investment from MGX, a United Arab Emirates-backed investment fund, The Information reported.
OpenAI CEO Sam Altman hailed the models as "a new paradigm: AI that can do general-purpose complex reasoning."
However, he cautioned that the technology "is still flawed, still limited, and it still seems more impressive on first use than it does after you spend more time with it."
OpenAI's push to improve "thinking" in its model is a response to the persistent problem of "hallucinations" in AI chatbots.
This refers to their tendency to generate persuasive but incorrect content that has somewhat cooled the excitement over ChatGPT-style AI features among business customers
"We have noticed that this model hallucinates less," OpenAI researcher Jerry Tworek told The Verge.
But "we can't say we solved hallucinations," he added.
The Microsoft-backed company said that in tests, the models performed comparably to PhD students on difficult tasks in physics, chemistry and biology.
They also excelled in mathematics and coding, achieving an 83 percent success rate on a qualifying exam for the International Mathematics Olympiad, compared to 13 percent for GPT-4o, its most advanced general use model.
OpenAI said that the new reasoning capabilities could be used for healthcare researchers to annotate cell sequencing data, physicists to generate complex formulas, or computer developers to build and execute multistep designs.
The company also said that the models survived rigorous jailbreaking tests and could better withstand attempts to circumvent its guardrails.
OpenAI said its strengthened safety measures also included recent agreements with the US and UK AI Safety Institutes, which were granted early access to the models for evaluation and testing.
P.Hernandez--AT