The IDA group is a leading centre of excellence for multidisciplinary work involving Artificial Intelligence, Data Science, and Statistics. Brunel University.
Happening now! Brunel University 's Computer Science PhD Symposium'25. Keynote Speaker Dr Michael Schlictkrull @michael_sejr who introduced the novel NLP task of finding and summarising indicators of reliability and trust, rather than truthfulness: automated source criticism.
I am so sorry that the U.S. is letting down our friends and allies. Broad tariffs, implemented not just against adversaries but also steadfast allies, will damage the livelihoods of billions of people, create inflation, make the world more fragmented, and leave the U.S. and the world poorer. AI isnβt the solution to everything, but even amidst this challenging environment, I hope our community can hold together, keep building friendships across borders, keep sharing ideas, and keep supporting each other.
Much has been written about why high, widespread taxes on imports are harmful. In this letter, Iβd like to focus on its possible effects on AI. One silver lining of the new tariffs is that they focus on physical imports, rather than digital goods and services, including intellectual property (IP) such as AI research inventions and software. IP is difficult to tax, because each piece of IP is unique and thus hard to value, and it moves across borders with little friction via the internet. Many international AI teams collaborate across borders and timezones, and software, including specifically open source software, is an important mechanism for sharing ideas. I hope that this free flow of ideas remains unhampered, even if the flow of physical goods is.
However, AI relies on hardware, and tariffs will slow down AI progress by restricting access to it. Even though a last-minute exception was made for semiconductors, taxing imports of solar panels, wind turbines, and other power-generation and -distribution equipment will diminish the ability to provide power to U.S. data centers. Taxing imports of servers, cooling hardware, networking hardware, and the like will also make it more expensive to build data centers. And taxing consumer electronics, like laptops and phones, will make it harder for citizens to learn and use AI.
With regard to data-center buildouts, another silver lining is that, with the rise of generative AI,Β data gravity has decreasedΒ because compute processing costs are much greater than transmission costs, meaning itβs more feasible to place data centers anywhere in the world rather than only in close proximity to end-users. Even though many places do not have enough trained technicians to build and operate data centers, I expect tariffs will encourage data centers to be built around the world, creating more job opportunities globally.
Finally, tariffs will create increased pressure for domestic manufacturing, which might create very mild tailwinds for robotics and industrial automation. As U.S. Vice President J.D. VanceΒ pointed outΒ in 2017, the U.S. should focus on automation (and education) rather than on tariffs. But the U.S. does not have the personnel β or know-how, or supply chain β to manufacture many of the goods that it currently counts on allies to make. Robotics can be helpful for addressing a small part of this large set of challenges. Generative AIβs rate of progress in robotics is also significantly slower than in processing text, visual data, audio, and reasoning. So while the tariffs could create tailwinds for AI-enabled robotics, I expect this effect to be small.
My 4-year-old son had been complaining for a couple of weeks that his shoes were a tight fit β he was proud that heβs growing! So last Sunday, we went shoe shopping. His new shoes cost $25, and while checking out, I paused and reflected on how lucky I am to be able to afford them. But I also thought about the many families living paycheck-to-paycheck, and for whom tariffs leading to shoes at $40 a pair would mean they let their kids wear ill-fitting shoes longer. I also thought about people Iβve met in clothing manufacturing plants in Asia and Latin America, for whom reduced demand would mean less work and less money to take home to their own kids.
I donβt know what will happen next with the U.S. tariffs, and plenty of international trade will happen with or without U.S. involvement. I hope we can return to a world of vibrant global trade with strong, rules-based, U.S. participation. Until then, letβs all of us in AI keep nurturing our international friendships, keep up the digital flow of ideas β including specifically open source software β and keep supporting each other. Letβs all do what we can to keep the world as connected as we are able.
[I had written this letter before the 90 day pause on the tariffs, but am sharing this here since many of the points are still relevant depends on what happens next.]
Original text: https://t.co/fNyTqzABWy
New benchmark for deep research agents! An agent that is creative and persistent should be able to find any piece of information on the open web, even if it requires browsing hundreds of webpages. Models that exercise this ability are like a frictionless interface to the internet.
BrowseComp, which stands for βBrowsing Competitionβ, has questions that leverage the asymmetry of verification. Humans start from a piece of information, and then write a question in such a way that it would be challenging to find the answer, but easy to verify the answer. This is an example of a BrowseComp style question:
Give me the paper published in EMNLP between 2018-2023 where the first author did their undergrad at Dartmouth and the fourth author did their undergrad at UPenn. (Answer: Frequency Effects on Syntactic Rule Learning in Transformers)
BrowseComp can be seen as an incomplete but useful benchmark for browsing agents. While BrowseComp sidesteps challenges of a true user query distribution like generating long answers or resolving ambiguity, it measures important core browsing abilities of persistence and creativity in finding information.
As a loose analogy, models that solve programming competitions like CodeForces demonstrate high coding capabilities that likely but not necessarily generalize to other coding tasks. Similarly, to solve BrowseComp, the model must be proficient at locating hard-to-find pieces of information, but it's not guaranteed that this generalizes to all browsing tasks.
A cool property of BrowseComp is that performance seems to scale with the amount of test-time compute spent trying to solve it!
We are bringing back Stanfordβs CS 25 Transformers Course (https://t.co/Yvq4AcLiBV) today! Itβs open to everybody!
This is one of @Stanford's hottest seminar courses. We open the course through Zoom to the public. Lectures start today (Tuesdays), 3-4:20pm PDT, at https://t.co/hhHwpf9L7h. Talks will be recorded and released ~2 weeks afterward.
Each week, we invite folks at the forefront of Transformers research to discuss the latest breakthroughs, from LLM architectures like GPT and Gemini to creative use cases in generating art (e.g. DALL-E and Sora), biology and neuroscience applications, robotics, and so forth!
Past speakers have included folks from @OpenAI, @GoogleDeepMind, @nvidia, @Meta, @AnthropicAI, etc. such as @karpathy, @geoffreyhinton, @DrJimFan, @ashVaswani, @_jasonwei, @hwchung27, @xiao_ted, @janleike, @YejinChoinka, @douwekiela, and many more! [Attached photos with some of themπ]
Our class has an incredibly popular reception within and outside Stanford, and over a million total views of our recordings [https://t.co/Eb4hKTZrbB] on YouTube. Our class with @karpathy was the second most popular YouTube video [https://t.co/vVMNYsKsEx] uploaded by Stanford in 2023 with over 750k views!
Also, livestreaming and auditing are available to all. Feel free to audit in person or by joining the Zoom livestream.
We also have a Discord server [https://t.co/vlDVm30x5F] (over 5000 members) used for Transformers discussion. We open it to the public as more of a "Transformers community". Feel free to join and chat with hundreds of others about Transformers!
Thanks to my co-instructors @DivGarg9@_KaranPS_@boson2photon Jenny Duan and the course's faculty advisor @chrmanning!
More details: https://t.co/Yvq4AcLiBV
@StanfordAILab@stanfordnlp@StanfordHAI@agihouse_org
#AI #ArtificialIntelligence #ML #DeepLearning #NLP #NLProc #Transformers #Stanford #Education #Innovation #TechEd #Community #naturallanguageprocessing
π πΊπππππ πΊπππππ π¨ππππππππππππI have the great pleasure to announce the first Lake Como Summer School on "π³ππππ πͺπππππππππ πππ π»πππππππππ: π¨π ππππππ π»ππππ πππ πππ π³ππππ πΊπππππ" - all infos below
RESEARCH FUNDING AVAILABLE: APPLY NOW
Our research fund is now open for applications, on the theme of βBuilding the digital goodβ.
Find out more here: https://t.co/nCKS17weLJ
An introductory talk by Christopher Manning @chrmanning on βLarge Language Models in 2025 β How much understanding and intelligence?β at the Workshop on a Public AI Assistant to Worldwide Knowledge at @Stanford, covering 3 eras of LLMs, RAG, Agents, DeepSeek-R1, using LLMs, β¦.
In todayβs competitive product landscape, scientific understanding of models often lags behind speed of model deployment. If the goal is to train a deployable model (especially when bottlenecked by compute), it totally makes sense to make several changes at a time without studying the effect of each one in isolation. Although this is not ideal, the bright side is, when we find time to go back and ablate how each component contributes to model training, it can be a gold mine of interesting ML phenomena. Like an adventurer going onto an unexplored planet filled with undocumented and never-seen-before alien life forms
The final admission that the 2023 strategy of OpenAI, Anthropic, etc. (βsimply scaling up model size, data, compute, and dollars spent will get us to AGI/ASIβ) is no longer working!
New piece out!
We explain why Fully Autonomous Agents Should Not be Developed, breaking βAI Agentβ down into its components & examining through ethical values.
https://t.co/HV4kHpPdWz
With @evijit@SashaMTL and @GiadaPistilli (1/2)
i'm comically impressed that people are coping on deepseek by spewing bizarre conspiracy theories -- despite deepseek open-sourcing and writing some of the most detail oriented papers ever.
read. replicate. compete.
don't be salty, just makes you look incompetent.
We are excited to announce the launch of Datasets and Benchmarks Track! It aims to serve as a venue for the presentation of high-quality datasets & benchmarks & tools that are essential for advancing research and applications in data science, data mining, data-centric ML.
Somewhat meta but there is a dopamine cycle in doing AI research that is pretty interesting
Every day you wake up and you think about what experiment to run. You think thing X matters so you decide to improve it or ablate it. Then you write the code and pay some compute to find out the answer. Then you get dopamine or confusion depending on whether the result matched expectations
For experiments that are small scale you can get several reward signals quickly and develop your intuition. Other times you make a big bet that is non-obvious or controversial and go through a period of hard work, dopamine starvation, and uncertainty to do it. If it works out it's a immense high but if it fails it's natural to be consumed by helplessness
Sometimes there is an element of ego. You may believe in X or not, and keep working to find evidence to support it or excuses for why it doesn't apply in that experiment. There can be an underlying urge to either to prove your abilities or to show that youβre still relevant in the quickly changing landscape. Science is supposedly objective but the reality is that there is a huge social aspect to it, and respect of peers is an valuable reward signal to seek
Thatβs how I feel about it, at least. Every day is a small journey further into the jungle of human knowledge. Not a bad life at all, one iβm willing to do for a long time
Also cool if your research gets deployed into product
This Saturday, we will pause to reflect on a year since the passing of @BZephaniah, our beloved Professor of Creative Writing.
When asked about his legacy, he simply said: "love". We honour that and extend our thoughts and love to all who feel his loss most deeply. π€π