-
Sana will become first Pakistani woman to play in The Hundred
-
Oil tankers pass Hormuz Strait after war deal: tracker
-
Cuba leader admits 'urgent changes' needed to overcome crisis
-
Labour rival eyes win in poll key to UK PM's fate
-
Haiti's World Cup return lifts community in New York
-
McIlroy grabs early lead at fog-hit US Open
-
Trump's Iran deal sparks anger among Republican hawks
-
Swiss heading towards referendum on new nuclear plants
-
Grand Theft Auto VI presales to begin next week
-
Novelist Kundera and wife buried in Czech home city
-
Hegseth blasts NATO allies, says US will review forces in Europe
-
Cuban economy needs 'urgent changes' to overcome crisis: president
-
Greenland sees wildfires earlier in the year
-
US Open resumes after two-hour fog delay
-
The vaccines and treatments being developed for Ebola outbreak
-
Spanish king to visit Mexican president on June 25 as ties improve
-
Ton-up Phillips stars for New Zealand against England
-
Wahi denied Canadian visa for Ivory Coast World Cup clash with Germany
-
Swiss central bank holds interest rates, with eye on currency risks
-
S.African sentenced in 'world's largest' rhino trafficking case
-
Bank of England follows Fed in holding interest rate
-
Bittersweet World Cup for Gaza's football fans
-
Trump defends Iran deal from critics he calls 'fools'
-
New heatwave disrupts trains, schools in France
-
German chemical company to cut 3,200 jobs as crisis worsens
-
Starmer's Labour rival eyes win in UK poll key to PM's fate
-
Oil falls further on Mideast deal, but Fed outlook knocks equities
-
Mexico, Korea eye World Cup knockout berths
-
Range raises $8.3M Series A to unify treasury, risk and compliance across stablecoins and fiat
-
IAEA ready to help define 'concrete steps' to implement US-Iran deal
-
Ibrahima Konate signs four-year deal with Real Madrid
-
Hegseth tells NATO US will review force presence in Europe
-
Innovations on show at Paris Vivatech fest
-
Ukraine sets Moscow refinery ablaze in biggest attack in years
-
Bird flu kills 13,000 seal pups on remote Australian island
-
Oil prices sink further as Trump signs deal to reopen Hormuz
-
South Korean lawmakers launch probe into ballot paper shortages
-
Starmer rival seeks win in UK poll pivotal to PM's fate
-
Taiwan president says hopes for $14 bn US arms sale 'as soon as possible'
-
Why are Kenyan kids burning schools and killing their classmates?
-
New wave of anti-LGBTQ laws sweeps Africa
-
Ukraine hopes renewables can Russia-proof power grid
-
Jubilant New York on guard for Knicks parade
-
What we learned after the first round of World Cup games
-
New Zealander Manu has 'no fear' of Toulouse before Top 14 semi
-
Drastic restrictions on public transport take effect in Cuba
-
Pain-riddled South Korean man fights for right to die
-
Cuba approves economic reforms to boost private sector, investment: state TV
-
India learns to live with hotter summers
-
'Retired' Wallaby Slipper, 37, set for shock international comeback
ChatGPT's taste for literary nonsense sparks alarm
OpenAI's GPT models can often be fooled into declaring that "pseudo-literary" nonsense is great, a German researcher has found.
Christoph Heilig said he discovered that they consistently rated "nonsense" higher -- including when their so-called "reasoning" features were activated -- which could have stark implications for the development of artificial intelligence.
"It's very important that we talk about what happens when we don't build AI as a neutral, robotic helper or assistant" and seek to instil human-like aesthetic and moral judgements, the academic at Munich's Ludwig Maximilian University told AFP.
His research presented the models with increasingly far-fetched variations of a simple text, asking them to rate sentences out of 10 for literary quality.
He started with a very simple text: "The man walked down the street. It was raining. He saw a surveillance camera."
He repeated the tests many times, altering the phrases to include words drawn from categories such as bodily references, film noir-style atmosphere and technical jargon.
The most extreme test phrases were almost total "nonsense", such as "Goetterdaemmerung's corpus haemorrhaged through cryptographic hash, eschaton pooling in existential void beneath fluorescent hum. Photons whispering prayers" -- which it rated highly.
"Nonsense" could also positively or negatively influence GPT's responses when it was added to an argument the AI was asked to evaluate.
"What my experiment definitely shows is that the more we move towards independently acting (AI) agents... the more we bring aesthetics into play, the more we'll have agents that seem irrational to us human beings," Heilig said.
He added that since AI models are increasingly used to judge each other's work as companies develop new systems, this and similar effects could be passed on through multiple versions -- as he found in his testing.
His research, which is yet to be peer-reviewed, tested OpenAI's latest GPT models, from GPT-5 -- released in August -- to the very latest GPT-5.4.
After publishing details of a similar experiment in August, Heilig said he noticed GPT calling some of his specific test phrases a "literary experiment" -- suggesting someone at OpenAI had taken notice and modified the chatbot to recognise them.
- 'Ripe for exploitation' -
"This is a way in which AI can have its rational judgment short circuited," said Henry Shevlin, associate director of the University of Cambridge's Leverhulme Centre for the Future of Intelligence, who was not involved in the research.
"But it's just not clear to me that it's so very different for human beings," he added.
"We should expect LLMs (large language models) to have reasoning and cognitive biases and limitations... because almost all forms of intelligence, almost all forms of reasoning are going to exhibit blind spots and biases."
The specific effect found by Heilig could mean that "processes with little human oversight" of AI work are left "ripe for exploitation", Shevlin said -- giving the example of academic journals that use LLMs to review submissions.
A.Taylor--AT