What are the limits of the AI mathematician? - FT中文网
登录×
电子邮件/用户名
密码
记住我
请输入邮箱和密码进行绑定操作:
请输入手机号码,通过短信验证(目前仅支持中国大陆地区的手机号):
请您阅读我们的用户注册协议隐私权保护政策,点击下方按钮即视为您接受。
人工智能

What are the limits of the AI mathematician?

If models can learn to master complex calculations, they could solve problems that have so far eluded us
00:00

{"text":[[{"start":6.74,"text":"The writer is a theoretical cosmologist at the University of Cambridge and director of the Infosys-Cambridge AI Centre "}],[{"start":15.13,"text":"Mathematics was once assumed to be relatively safe from the incoming juggernaut of artificial intelligence automation. Chatbots might be able to generate text, code and images on demand, but the deep reasoning required for mathematics was supposedly out of reach. The gold medals that OpenAI and DeepMind recently achieved at the International Mathematical Olympiad have therefore left maths professors like me feeling suddenly a little less safe. "}],[{"start":48.28,"text":"Is AI about to do to mathematical proofs what it’s already doing to coding? After all, the two have clear similarities: both are highly structured “languages” with clear conventions and restricted “dictionaries”. Both have large corpora of examples on which AI can be trained with known solutions.  "}],[{"start":72.57,"text":"Yet while the results from cutting-edge AI maths models are impressive, there is another class of maths that generative AI still struggles with: simple computation. Ask “what is 5.11 minus 5.9?” and the answers vary. This morning, OpenAI’s latest GPT5 model gave me the correct answer of -0.79. But phrase the question as part of a calculation and you may receive a different answer."}],[{"start":102.72,"text":"What should we make of AI models that can outperform high school-age Olympiad competitors but cannot always add or subtract to primary school level? To understand this, it’s helpful to think about what it means to be good at maths."}],[{"start":117.77,"text":"The way maths is taught is by showing students a problem, demonstrating the method required to solve it and then assigning examples. Weaker students require numerous examples and sometimes end up simply memorising the method without understanding it. The strongest students need only one or two examples to master the concept and apply it to new problems."}],[{"start":142.35,"text":"The ability to conceptualise and generalise distinguishes the best mathematicians. Good mathematicians solve hard problems; great ones find ways to make the hard problems easy."}],[{"start":156.35,"text":"The strengths of AI models lie in their speed and ability to “practise” at extremely high volumes. This means they can solve very difficult problems that bear some resemblance to things they have been shown before but may struggle when given something new. This is particularly a problem for theoretical maths. The number of examples available for training drops as you move towards more advanced problems."}],[{"start":183.04999999999998,"text":"These are well-known issues with neural networks. They are great at interpolation (generating answers that are “between” things they’ve seen before) and bad at extrapolation (generating answers that fall outside their training set)."}],[{"start":197.63,"text":"In maths, this is made extra difficult by problems that sound similar. Consider: “What is the maximum number of cubes of volume 1 that you can fit in a cube of volume 64?” and “What is the maximum number of spheres of volume 1 that you can fit in a sphere of volume 64?”. They sound alike but one is simple to solve (cubes fit together neatly in a 4x4x4 block), while the other is fiendish (spheres do not stack nicely)."}],[{"start":231.82,"text":"What this means is that AI use in applied mathematics and cosmology is still limited. We can take things we already know how to do and use AI to automate them. But so far, calculation has seen little advancement."}],[{"start":247.35999999999999,"text":"It is possible, however, that more training will solve the problem without extrapolation ever being required. If AI models can be fed enough complex calculations they could perhaps solve problems that have so far eluded us without the need for any human-level inspiration."}],[{"start":267.84,"text":"The question being asked in my field is: “How powerful is an extremely fast, extremely well-trained, unthinking mathematician?” We are in the process of finding out."}],[{"start":null,"text":""}],[{"start":287.28,"text":""}]],"url":"https://audio.ftmailbox.cn/album/a_1755773104_9805.mp3"}

版权声明:本文版权归FT中文网所有,未经允许任何单位或个人不得转载,复制或以任何其他方式使用本文全部或部分,侵权必究。

“蓝领繁荣”的真相

技术型体力劳动带来些许亮点,但恐不足以扭转艰难的就业形势。

Anthropic如何在AI编程取得突破并撼动商业格局

新的人工智能驱动工具正在压缩软件开发的时间与成本,并对从法律到广告等行业构成威胁。

并非所有软件都面临相同的AI威胁

从安全服务到能够彻底改造的公司,许多企业或许都能在“AI末日大决战”中存活下来。

李开复:为何中国将在消费级AI领域击败美国

这位中国人工智能先驱谈到了AI领域两大强国之间的竞争,以及企业为何需要更积极主动地采用AI技术。

据信俄罗斯间谍航天器已拦截欧洲关键卫星通信

欧洲安全官员认为,莫斯科正将未加密的欧洲通信内容作为攻击目标。

印度欢迎特朗普的“协议”,但回避讨论俄油禁令

分析人士对美国总统声称莫迪已承诺停止购买俄罗斯原油一事深表怀疑。
设置字号×
最小
较小
默认
较大
最大
分享×