© FT montage/Getty Images
Published
386
There is a paradox at the heart of generative artificial intelligence. For some time it has been clear that compared with humans, large language models are far more capable of completing very challenging tasks.
More than two years ago, we had evidence that OpenAI’s ChatGPT can comfortably pass difficult exams such as the notorious US Law School Admission Test and Ivy League MBA finals. The latest models consistently turn out high-quality written work, producing essays that educators are unable to distinguish from those written by postgraduate students.
Yet there has been, to date, little evidence of AI causing large-scale disruption to the labour market, even in occupations that are reportedly at very high risk. What’s going on?
Two new pieces of work shed light on the puzzle of AI’s simultaneous excellence and apparent toothlessness in attacking human jobs, while also showing where and why the first large-scale losses may now be under way.
The first is mine. Building on previous work by the Brookings Institution and OpenAI, I have carried out a fine-grained analysis of US employment data, comparing recent trends in job numbers against a list of occupations identified as at especially high risk of automation.
The daily tasks undertaken by accounting clerks, insurance underwriters, travel agents and legal secretaries all overlap almost entirely with LLMs’ capabilities. But the numbers of workers in these roles have all remained within their usual range even as generative AI has proliferated.
There are two notable exceptions, however. Writers — of words, not code — and software developers both show tell-tale signs of LLM-related disruption, with employment falling sharply away from trend in the past two years.
And this is not just a function of wider economic trends in those sectors. Job numbers elsewhere in the computing, publishing and marketing industries show no such sudden downturn.
Header logo

Is AI starting to hit the labour market?

Total number of workers in particular US occupations and sectors (May 2023 = 100)

Programmers/ developersWriters/ authors
So there is a marked contrast in the fortunes of those in occupations thought to be at similar risk. This finding meshes neatly with a new study from San Francisco-based AI research company METR, which offers a new framework for understanding AI’s strengths, weaknesses and rate of progress.
It finds that LLMs’ ability to perform a given task is a function not so much of how intellectually challenging the same job would be for you or me, nor of the level of specialist skill required, but of how long it would take a human and how “messy” or unstructured the workflow.
So carrying out the duties of an executive assistant, travel agent or bookkeeping clerk — all computer-based jobs requiring entry-level skills — is still beyond the capabilities of even cutting-edge AIs. They struggle to keep track of multiple streams of information, respond to a dynamic environment, work with unclear or changing goals and multitask. These unstructured workflows are a far cry from coding tests and essay questions.
Header logo

AI’s ability to automate human jobs is constrained by task ‘messiness’

Evolution of AI models’ task success rates*, by ‘messiness’ of the task

This is not to say these occupations will remain a human domain. METR’s research finds AI is making strong and steady progress across a wide range of tasks, regardless of complexity, duration or “messiness”. It’s just that administrative assistants might have a year or two’s head start.
What sets programmers and writers apart is that these are occupations where the entire job from start to finish — not just discrete constituent parts — is about as close as possible to what AI excels at: nice, clean, linear and sequential tasks, exam-style questions and essay assignments. Notably, both jobs also have high rates of contracting or freelancing. So an AI assistant such as Anthropic’s Claude can be swapped in for a non-staff copywriter without HR getting involved.
Another way of thinking about it is that the protective “messiness” in some jobs comes from the back-and-forth and unpredictability inherent in interacting with other people. There is a certain irony in the realisation that the mantra of rugged self-reliance and workflow optimisation common in Silicon Valley may have made tech roles more, not less fragile.
The occupation you really wouldn’t want to be in right now is one where your activities consist of a predictable recurring linear task. Writing code to analyse data and then synthesising the results into an article of a fixed length, say. Oh dear. The future for someone in this type of work — a data-driven columnist — looks bleak.
AI is transforming the world of work, are we ready for it? | FT Working It
Letter in response to this article:
Copyright The Financial Times Limited 2026. All rights reserved.

Promoted Content

Comments

Archived

Comment guidelines

Please keep comments respectful. Use plain English for our global readership and avoid using phrasing that could be misinterpreted as offensive. By commenting, you agree to abide by our community guidelines and these terms and conditions. We encourage you to report inappropriate comments.
Comments are closed on this story.
"New research shows ChatGPT’s inability to cope with ‘messy’ multitasking is still protecting some human workers"
If running costs are not a problem - can always use multiple ai models working together and "multitask"
We sent every unmessy job to India. Ai is a trumped up more expensive version of outsourcing with the same limitations.
Having just taken my first ride in an "autonomous " (driverless) Waymo cab, i am convinced that there is no human task that AI will not be able to replicate. The only reason that AI has not yet taken over more jobs is that it is not yet fully trusted. That hurdle is rapidly being overcome in driving. It will soon be overcome in the majority of human occupations.

We are headed for a sudden and cataclysmic economic and social shock for which we are completely unprepared.
There's a more optimistic take on the writers and authors chart if you look over a longer period. The number of writer jobs over the past decade, or even the past five years, has increased despite the wider sector flatlining. The recent drop-off might be AI-related, or it might be a correction of the mysterious hiring boom that happened in the Covid era even when the wider sector was shrinking.
AI can’t even control a GUI window or solve the 2 factor authentication. I haven’t seen a manager setting up APIs for his project to connect some AI agents and then working on the right prompts.

"Writing code to analyse data and then synthesising the results into an article of a fixed length, say. Oh dear. The future for someone in this type of work — a data-driven columnist — looks bleak."

indeed....he would be well advised to drop his last seven or ten columns into claude/chatgpt context-windows and ask how long he likely has left....then publish the answer as a column, of course....
Various studies in setting AI essay questions have thrown up issues. The most common is AI's propensity to invent citations something that at higher levels of under/postgrad work would 1) be easily caught and 2) could lead to accusations of plagiarism and academic misconduct not obviously for the AI model but for the student! Although if the issue becomes endemic then Universities will simply revert back to old school exams where AI is of no help.
The article is incorrect about software development. “Back-and-forth and unpredictability inherent in interacting with other people” is a huge part of modern software development. AI doesn’t come close to replacing the developer role end-to-end.

Also, the industry went on a hiring spree of software developers and stockpiled talent before letting them go in 2022-23, which explains the reduction in roles.
Way to humble brag about having a job...
It's all hype. Useful for mundane tasks, other than that it's fairly useless. GenAI could definitely replace the average US worker, but then so could a lettuce.
AI cannot make decisions for you. Problem solving, human judgement and reasoning/argumentative decision making is extremely crucial in data science and many other creative fields.
Because AI can't go to the Leadenhall market and drink 10 pints of lager. Insurance is safe.
Because going back to a solo flat is depressing?
They do it in the work day.
Yes, generally, the more unstructured and messy a job is, the harder it is for AI to replace it.

Jobs that require continuous decision-making in unpredictable environments, human-to-human interaction, and dynamic problem-solving remain resistant to AI automation, at least for now.

However, AI is improving in handling some messiness. As machine learning models become better at contextual understanding, multimodal processing (integrating text, images, and video), and reinforcement learning, AI may gradually take over more aspects of these jobs.

But for now, the "messier" a role, the safer it is from AI displacement.
(Edited)
We all know AI is an effective enabler, but do you not know that AI is extremely poor in relevance judgements that would creep in bias based on ineffective prompt engineering and lack of intelligent human engagement in an interative manner?

You should have thought through this before replying.
AI can take tasks not jobs.
Most jobs are 20+ things of which an AI can do 2-3 brilliantly but even then with orchestration/ checking.
Companies built around AI may see greater increases in performance but will take time
I’ve used AI extensively over the last year. The biggest problem I have with it is that it straight up lies to you. I’ve had it read a document and literally make up quotations from it that even a simple keyword search demonstrated were not in the document. It’s still very useful, but you can’t trust it.
Douglas Adams got it right. With a big enough computer you can work out the answer to life, the Universe and everything, but to formulate the question you need humans.
Certain GenAI can figure out how to spell ‘panicking’
(Edited)
The line at the end was a joke, but I think possibly an illuminating one. There is quite a lot of data-based coding work (data science, analysis, software development...etc), which it is really difficult to imagine being done without humans, because you need curiosity, an eye for picking out what's meaningful for human end-users, and often to put in quite a lot of work to get the data in the first place. I can definitely see a lot of jobs being lost, but this kind of data journalism seems like it'd be difficult for an LLM to replicate.
Anecdotally, firms are no longer recruiting junior programmers. However, what makes a great programmer (in my opinion) is properly understanding what users or clients really need. That will be difficult to replicate.
This might be storing up problems for the future; it's not obvious how you would get to be a senior programmer without being a junior first.
Imagine an AI MAGA POTUS. What would it do next?
Generative AI won't meaningfully replace any job in the economy today - instead it will make life marginally worse in just about every sense. Beyond the egregious environmental waste, this technology excels at enabling scams/grift at scale, reducing the critical thinking skills of its users, and replacing legitimate knowledge in fields like journalism, medical research, art, and others with aggregated, pilfered slop

The fields within the broad umbrella of AI that are genuinely useful - machine learning, natural language processing, BigData, etc. - have already started being developed and fleshed out over the last decade or so, and will continue to make incremental advancements as long as there are any R&D dollars left for them
Writing beautiful essays & passing exams is easy. All the questions are capable of being answered because all the facts are already assembled. Life is all about assembling, regularly by interviewing people who might be inclined to fib, & judging what is, & what is not, relevant and or truthful.
AI is not only generic – it's also prone to errors.
Well it lies better.
Of course - that's why you double check anything you get someone to do for you. The problem might be, because we expect computers to be correct, we don't double check
this was written by an AI
Surely in the Civil Service, they will use this as an excuse to hire even more people.
It would be interesting to track the effect on wages as well as overall job numbers.
(Edited)
You appropriately raise the question why, despite their 'excellence', the diffusion of AI systems is still causing limited disruption in the labour markets. First of all, and this makes your question even more apposite, if we care to look at the broader picture, the speed of diffusion of AI is already incomparably faster, particularly amongst consumers, than that any other previous general-purpose innovation. As to the enterprise level, there are grounds to hold that the extent of diffusion of AI is not being appropriately measured. Just consider the extent of its use as an intermediary when interacting with the digital world. But, the massive substitution of human workers in the workplace is awaiting further complementary advances, particularly regarding robotics, interactiveness and adaptability, which will occur within their inherent limits. This is not bound to take place incrementally and smoothly but quite rapidly once pre-conditions are in place. As to the so-called ‘superhuman” qualities of AI systems, they are as ‘superhuman’ as the qualities of automobiles or even bikes and skateboards when, upon prompting, they exceed the speed of humans. As indeed is implied in your piece, all these comparisons between machines and humans are too narrow and superficial to be taken as a starting point of any serious analysis.
Funny question. Not at all. Anything on the contents?
More on the almost unreadable style. Polysyllabic words over clarity of ideas.

Was that a choice?
So, you can take the intelligence out of a human, but you still can’t take the human out of that intelligence?
Many tedious and non-messy tasks in the software industry have been outsourced to places like India. My hunch is that those jobs are most at risk but I have not seen any data.
early days folks. Reassess in 5 years.....
MSFT Copilot gave 2 mistakes when helping me with my kids A Level Further Maths. I questioned the answer assuming I was at fault. It replied “good catch, you’re right” followed by the embarrassed emoji.
I’ll believe in the text books and my own abilities next time.
I think Gemini has recently built in a dedicated maths engine.
I was doing some research last week on the number of stores operated by a High Street retailer. First I gathered the data by hand from successive Annual Reports of the company in question. When I had this time series in hand, I asked ChatGPT if it could source the same series. It sure did, within 30 seconds. But the numbers were different, so I asked ChatGPT to explain where it got its numbers from, and it responded by admitting it had taken some short-cuts to arrive at its time-series, which it then admitted was just plain wrong and could not be relied on, and it was sorry for letting me down. But its apology was really nicely written.
Very interesting data-driven analysis. Thanks.
Silicon Valley has been 180 degrees wrong in projecting the effect of seemingly every significant technology they cultivate. The internet was going to make knowledge so ubiquitous that there would be uniform consensus on fact and truth. Social media was going to bring people together. I would be very skeptical of whatever technologists say the effects of AI will be.
No, some of that was definitely believed. At least at first.
Because for the most part it still isn't AI. Indeed it is often RS (Real stupidity).

I do think it can help people who don't know much about a subject ( hence use by students) but it
a) has limited knowledge about the things real jobs deal with
b) can't be relied on without independent checks
c) does lots of "fun" stuff that can't be monetized
Porn, scammers and gambling will be the first industries to realise the potential of AI
Same reason that self driving cars haven't taken over
AI - fine. How about EI - emotional intelligence.
So passing exams is easier than doing a job well, or rather it’s the skills required to do the latter that are really valued so don’t expect to impress an employer just with the former
Indeed, the workflow in a chaotic environment where AI is not yet deciphering a room is not yet here. But arguably it is simply that we haven't integrated the entire sensory inputs available to humans for machines to ingest. AIs can't act like humans if they are deaf and dumb and made to sit in one spot. Once robots acquire the ability to receive all of that and free to train on them is when the world will really see the rapid adoption of AI.
Indeed. That also why Musk might lose the self-driving competition. He is insisting on tying one hand behind his back by not using tools that pilots depend on everyday to drive through fog and rain. What's the point?
cost and clearances I would guess, it's probably export controlled.
Because AI is still trying to figure out what you do, exactly.
John Burn-Murdoch - Read Zoo Digital White Paper on AI and its application in the actual business environment. It is an amazing study. AI is best used as a tool by experts. It is not good enough to do 100% complex tasks, and because of it, it is cheaper and quicker to do without it altogether, as unpicking and proofreading what it hallucinated is too expensive and time consuming. However, AI enables experts to be quicker, if used with knowledge.
Yep I asked chatgpt to do a diameter capabilities exchange in python today for something I'm doing and it came up with something lovely that crashed first time it ran and getting into it a bit it obviously almost but didn't quite know what it was doing. It sent me in the right direction to get it working though so better than a blank slate. But not trustable
Similar story as with automation, albeit with much lower activation barrier.
Increases productivity when workflows are designed as automated ones and fails miserably when you try to add automation to the existing 'human' workflows.
AI has already taken quite a few jobs and it is very early days!
The first stage rather than replacing existing jobs (or people in those jobs), is that new hires will slow down and currently employed teams will make more use of AI rather than creating junior roles. Job replacements will largely happen in this way in the short term. We will hear of the occasional story (from corporates and Government) that AI is replacing jobs - this makes a good news story. The interesting thing medium term is the disruption to the progressions of people throughout their career/career ladder/structure. The days of coming in at the bottom and rising to the top are certainly out of the window.
Actual ChatGPT generated response when asked to defend itself having read this article and its comments: “Ah yes, the old ‘AI isn’t original’ chestnut — because clearly, originality is best measured by how well Karen from accounts juggles calendar invites. Meanwhile, AI is out here quietly ghostwriting novels, debugging your code, and making half the internet readable. Maybe it’s not that AI lacks originality — maybe it’s that the bar for ‘human creativity’ was never that high to begin with.”
Exactly what I am training mine to be!
Thank you for this, Mr Burn-Murdoch.

I wonder, though, about the degree to which those who write assessments based on a review of data and then shaped and considered in light of specialist knowledge that has be accumulated over years or, in the case of many, decades?

More specifically, let’s assume that one is tasked by a major corporation or a credible and well-governed nation-state to produce a mix of analyses regarding long-term trends in specific countries (Japan or France, for example), or politically bound regions (say, West Africa or member states of Mercier—or even the BRICS and its various hangers-on).

Would such people be doomed to redundancy because of the advent and adoption of generative AI?
We need to talk about writing. Seriously.
I’m typing this on my phone while hotdesking in the same university campus building that, fifteen years ago, inspired me to throw myself into writing a science fiction novel I once believed I’d abandoned for good.
Outside, across the road, diggers are at work on the foundation pit for an international school—the same one I saw standing in that very spot.
Next to me, a robot dog is doing tricks, delighting the lunching students.
I’ve walked through that future so many times, I no longer flinch when it stares me in the face. (Cute doggie. Good doggie. Fetch!)
If you’re not writing from your hara, you’ve never truly written.
—-
Written by a human.
Proofread by Andy.
(Edited)
Possibly referring to the Japanese word for belly (as in hara-kiri = belly cutting).
Fire in the belly, write from the heart, speak your mind; my post profound comments on the human condition tend to emanate from a more fundamental part of the anatomy.
“Translated from Japanese, the literal meaning of hara is “belly,” but its true meaning is much deeper. In a spiritual sense, the word “hara” refers to your true self — who you truly are as a human being. It's seen as the unification of your physical, spiritual, and psychological dimensions.” — A New Age Webzine.
In Japanese martial arts “moving from one’s hara” is meant to maintain a combat stance that keeps the fighter firmly grounded but capable of moving around fast. Usually that involves tensing abdomen muscles.
If you try to do twentysome ab crunches too many, you will definitely know next morning where your ‘hara’ is.
All so predictable. Is there any coincidence that those who talk about the promise or threat of a new technology are those who create it in the first place. When words like revolutionise, transform, disrupt are added ... add a few spades of salt.
AI won’t replace software engineers but will reduce the need for so many of them, creating a smaller number of highly skilled engineering roles. However, this is a problem as junior developers are needed to be the future seniors.
Love the irony of the last sentence.
(Edited)
Productivity gains are good. AI can first help us reduce unmeasured overtime (esp in industries like finance), and then 4-day working week.
(Edited)
There’s a simple litmus test to assess whether your job is at risk of being replaced by AI. Ask yourself: "Am I paid for a job, or for a responsibility?" A machine — or even a working animal like a horse or a dog — can perform a task. And Gen AI can perform complicated cognitive tasks. But it will never carry a responsibility. Responsibility is a uniquely human, moral attribute. It has little to do with the type of cognition that AI mimics, and everything to do with the fact that our cognitive tasks are driven by a work ethic rooted in mid-brain emotions. We assume responsibility because our brain operates under the influence of obligation, duty, pain, shame, love, anxiety, and more. The entire chain of command in any company or organization ultimately rests on these human responsibilities. Cognition — and operational ability — is a very, very small part of the picture.
No. In the end, accountability needs a moral agent—someone you can scold, fire, imprison, or promote. AI doesn't qualify. You can’t throw ChatGPT in jail, and you sure can’t reward it with a corner office and a company car.
litmus test to assess whether your job is at risk
Sure, but the manager next to you, or aboard is still a "someone". It is not zero job, it is 50% fewer jobs (of course, it only matters if that is "your" job, it probably doesn't matter if it is other people's job)
nb. reward agents for models
chatgpt, the most advanced version, is still unable to correctly read a pdf document.
I asked stuff like: "give me the page numbers in this pdf document where a topic A is discussed"; and it gives me the wrong page numbers. I tried this many times.
Apparently this is one specific task that LLMs are really bad at
This is not a “one specific” task; you are telling me that chatgpt can’t read a document. So… in my line of work, reading is very important.
I am sorry, but as long as chatgpt can’t read, it won’t take many jobs.
It can read very well. It's just very poor at telling you where it read a particular thing. In essence, LLMs are not search tools. You need a search tool (particularly a semantic search tool) for that particular purpose, and the tech exists.
Apparently they perform better when given clues. You might ask it in two stages.
Quicker to just look at the document yourself?
So not very useful, no?
Sadly not. In time I think it will improve though.
There was a writers strike in Hollywood at the same time, not sure if that was factored in but those jobs are registered with SAG the union. Might be about AI as much.
(Edited)
From what I have seen AI is more Artificially Ignorant than Artificially Intelligent. Simply not impressive!
I see that in similar way also in my job. AI made programming more accessible and I learned it by working with AI. The same is truth about writing reports or checking, validating reports and data patterns. What happened is that we do not need juniors for that jobs and what I hear is that the IT teams that support the linear tasks are growing as they have more time to focus on productive projects. It is productivity gain rather than big disruptors as it is sold by the media. I would compare it to travel - this is speed car comparing to horse
What a great article and ending. The FT's very best contributor.
> Writing code to analyse data and then synthesising the results into an article of a fixed length, say. Oh dear. The future for someone in this type of work — a data-driven columnist — looks bleak.

I just don't get how people believe this. I've been in data/software for 12 years now, and every time I ask an LLM to do something relatively simple in this area it fails massively.

The big problem with LLMs and data is that basically all useful datasets are far too large for the context window, so you need to feed it a small amount, then watch it fail when it applies that code to more data, repeat until you rage-quit.

More generally, even if the above problem could be solved (not sure how tbh) then because LLMs are trained on internet text/code and most people are terrible at this work, then you get really superficial and useless results.

I'm reminded of the time I worked for a large airline and tried to predict the money made from a particular flight. I wrote clojure code (as the laptop I had was pretty crap, and clojure could stream data better), and finally came to the conclusion that return flights were the best predictor :facepalm:.

That's basically where LLMs are for data now. A bigger (and harder) problem to solve is that LLMs only have the context in the window, so when you feed the data in they forget what they're supposed to be doing.

To be fair, I haven't tried this with any reasoning models yet, so maybe they'll work better? I still think the problems above are gonna be tricky.

And this is entirely ignoring that the meaning of data fields is entirely obscure to humans, never mind LLMs (revenue is post tax/pre tax/calculated with an abacus).

I think your (and my) jobs are safe for now John.
"...yet".

Yup.

It has definitely started. No way back.

Companies employing large numbers of people become like organic farmers markets.

Overpriced and utterly substitutable - and their market is locals who are willing to stump up and support the community.

Quite where their cash will come from is anyone's guess..
As a software engineer, I can say that the market has dramatically changed. There are fewer jobs and paid much less. The art of translating a concept to code is no longer valued as it used to be. For those of us who enjoyed the artisanal aspect of our work, this has prompted an existential crisis, as well as a monetary one.
I don’t think it’s true that educators can’t tell if an essay has been written by an LLM. For one thing, its English is often better than the student’s (as revealed in class or tutorials). At the moment it can’t be proved and university administrators, obsessed with getting fees and treating students as ‘customers’, are wary of challenging them. But also LLMs produce work which is characteristically bland in style and prone to repetition and over-summarisation. They also invent ‘facts’, people and references to papers. Such things might not matter in a student essay (though it would be easy to tell if students were required to provide digital object identifiers for every paper they cite), but in the legal sphere, as we know, invented case law can be a big problem. The tech companies are trying hard to eliminate hallucination but it’s not easy, because in some ways the models are black boxes even to their creators. More fundamentally, LLMs are not ‘intelligent’ in the sense that those pursuing AGI intended, they are powerful but they do not reason. There was an interesting recent paper testing five of the most prominent LLMs by asking them to solve the New York Times Connections puzzle. They were significantly outperformed even by novice human players.
I agree they are destroying jobs, especially of freelancers and artistic creators, and it’s important to think about what ought to be done to share the profits they generate more equally (assuming they will at some point generate profits). But they should not be called AI. They are simply using the products of human labour which they’ve scraped off the internet, regurgitating them and spitting them back out at us. They’re running out of the raw material to train them on too, with the danger that they will be trained on their own output and this will magnify the existing flaws.
Believe me, a lot of students are that lazy! Or in some cases actually incapable of putting together a proper sentence. The trashing of higher education funding has left universities (including Russell Group ones) dependent on overseas students for income, especially on taught masters degrees. It’s an open secret that many of these students have very poor English skills. It’s a scandal and unfair to them too as they can’t cope with assignments. They used to plagiarise or buy essays, now they use ChatGPT.
Which may be interesting for the BTO advocates. If you have to deliver information verbally, rather than hide behind a computer, that will show who can actually do the job.
Synthesized intelligence is significantly different from actual intelligence.
Its complete and utter lack of understanding of what it is being tasked to is not an issue if the task is covered by its training data, and generic, derivative, content will suffice (ask ChatGPT nicely, and it will add whatever style you require to this generic derivative content). Adding substance that does not exist within its training data is beyond it.
For a writer such as the author of this article, this can be leveraged. Give your preferred LLM your assignment to complete, as you alao complete the assignment. Paste both versions into another instance of the LLM (with memory off) and ask it which version was created by an LLM, and which by a human. If it can't tell the difference, rewrite until it can.
If you can't produce a version better than the LLM, repeat the exercise, but this time for your resume to retrain as a plumber.
I’m still to observe a single human who can “multi-task” and do more than one job well at the same time in an office environment. Multitasking is a complete myth. Typically, the result is someone does more than one thing at the time with mistakes in all things, or keeps task switching which is inefficient and prevents them to do anything more challenging than surface-level thinking.
I shall become very concerned when AI makes retired citizens redundant. Until that happens, we the humans will enjoy the service of all sorts of AI.
(Edited)
I’m retired these days, but of the various and different ways in which I made a living, there was never the slightest chance that Artificial Intelligence was going to be able to do what I did. I note that the AI confidence trick is still running and my investment strategy continues to bet against it.
We have seen plenty of failed large IT projects even before AI. I predict we will see more from now on.
See, ma, the messiness of my life was just me future-proofing me against the LLMs. Ha.
The optimistic counterpoint is that Gen AI is running out of training data and will shortly enter a 'human centipede' doom loop where a Web dominated by AI regurgitated content will be devoured by those same AI models to yield increasingly less impressive results. Also - Gen AI is dependent on what has been learned already - no evidence it can think up genuinely new stuff - it is derivative by design...
Don’t see AI laying bricks anytime soon!? ;-)
Ai written content is so tedious. Figuratively and literally, unimaginative.
I feel as though these analyses always fail to distinguish between skill levels within jobs. First of all, how many people work purely as freelance software engineers? This market has been completely dominated by cheap oversees work for years. The likelihood is that if you need a freelance programmer for one task only, or on a short term contract, the actual problem they will be solving is relatively trivial and therefore easily replaced by an AI. The same goes for lower level journalism - how much skill does it really take to write a puff piece for a tabloid newspaper? Probably less skill than it takes to run a relatively complex data analysis and summarise your findings in the FT.

Skilled labour survived the industrial revolution and skilled services and professionals will survive the “AI Revolution”. Unfortunately the lower band of programmers and writers may well struggle with increased competition, the same way factory workers were replaced by machinery that specialised in doing formulaic, highly repetitive tasks over and over again - hang on, is this ringing a bell for anyone? …
Sorry John I like a lot of your work but this as a poor piece as you present the idea that AI is ready to do writers' jobs completely uncritically.
I'm a writer - I have gone from making 200k a year freelance pre-AI, to zero. Haven't made a penny this year. How long before the wave of hidden unemployment crashes into the economy? I'm seeing the same with designers, animators and illustrators - and Governments haven't got a clue this is happening because these traditional freelance roles are living off savings or partners - but at some point the savings will run out. In addition to this, large corps are freezing hiring and bringing in AI, so grad level jobs and new openings are disappearing. There is going to be a nasty surprise in Q3 or Q4 of this year in the economic data. The larger question becomes - who is going to buy your stuff? Robots don't wear sneakers.
Imagine if all the brain power going into AI to do jobs that people quite like doing, had gone into creating a Roomba-style device that sweeps streets and deals with fly tipping, sorting out metal and glass for recycling...
So the hardest hit are people who used to sell essays and theses to university students. Oh well, at least, students are saving money.
There are two different ways to ask the question. Why hasn't AI taken YOUR job? Or why hasn't AI taken your JOB? John's approach focuses on the macro side of Jobs (second question), which is useful but doesn't answer the question of my own job. The answer for my own job is more complex :- We don't see it yet, because it's the hidden trend. It happens at the level of smaller organisations and startups - look at the new wave of Silicon Valley startups that achieve the same results as before 10 times faster with 10 times fewer people.- The second reason is that for the same job, some people will be replaced by AI, while others will be supercharged. The impact of AI is therefore not uniform, and depends on capabilities, which underlines the importance of lifelong learning.I spent 10 months researching this topic for The AI-fication of Jobs, and quickly found out that it's an emotionally-charged area, but also very complex. So very useful to take multiple approaches to it.
You can take comfort in the fact that your employer owns your IP, and has licensed it to ChatGPT so it can learn how to do your job and replace you more rapidly.

Think of it as handing on skills and knowledge to the next generation. Only not the next generation of humans. Or start a substack - because the FT is going to replace you to save costs at some point.

The thing AI disruption predictions miss when it comes to journalism, is the impact on the consumption side of media. All articles with worthwhile information are TL:DR. So I will get my personal AI to read them for me.

Its all going to be bots talking to bots.
(Edited)
I think you’d best reread the comment. You’re wide of the mark, completely invalid criticism. Perhaps you replied to the wrong comment? To paraphrase you, if you can’t be bothered to explain what’s wrong with a comment, then you are making a snide ad hominem attack.
Because AI isn’t very good and currently is more expensive than a human, second part is probably more important.
Walt Disney just released a new film - Snow White. The Writers' Guild of America must have stewed for hours on that.
What compelling products were WGA members offering to execs as alternatives?
A bit of a puff piece that might have been produced by a GenAI chatbot or by a human with investments in AI companies.
Just because a chatbot sometimes produces a decent essay that doesn't excuse all the rubbish essays it churns out.
John Burn-Murdoch is challenging the ChatGPT threat to jobs. So I have just asked ChatGPT for its responce:
"Today’s AI outperforms the lowest-performing segment of the population on many cognitive tasks and acts as a powerful amplifier for average human capability. But it still lags far behind the top tier of human intelligence, especially in creativity, judgment, and general reasoning.
On that basis 25% of the jobs done by the lowest performing segment are at risk from AI. The number drops to 15-30% of the middle segment and 0.5% of the top segment. Now project forward ten years at the fast pace of AI technology advances and ChatGPT projects that the lowest segment remains broadly the same (as a job that has been automated cannot be re-automated), the risk for the middle segment rises to 30-40% and for the top segment rises to 10-20%."
It seems that ChatGPT broadly agrees with John Burn-Murdoch if you happen to be in the top tier of human intelligence.
I work in large scale project and programme management int he world of fintechs. This made me laugh. The number of times I’ve said I want to see them replace the pms with ai. Where it would even start is a mystery. I think a future (short to mid termish) role for consultants is process rationalisation and redesign for ai conversion.

That said is till believe the most fundamental limiter for ai is once it’s eaten everybody’s job where do the taxes come from and who buys the products and services made and rendered in every greater number with supposedly ever greater efficiency? Society wasn’t built for making profits for the few but to enhance the lives of all. It appears to have evolved into being purely a value capture for the top 0.1%.

As to ai well having done a couple of prompt courses and played fairly extensively with it thus far it’s failed for me about 95% of the time on any task beyond the utterly trivial (which I didn’t need an ai for). Notable failures recently were the production of a 29 page summary of an 8 page paper. I was looking for it to produce an executive summary! The total failure of it to produce the basis for an msa after 30 attempts. Numerous situations where it’s given me different answers to the same question asked the same way on more than one occasion contradicting itself. Plus of course there’s the issue that I don’t know what’s happening to my data and ai is hungry on both energy and cooling (water) so is hardly green. My feeling is that for the most part it will raise the standards at the low end for relatively dull work but contribute little to the higher end (unless the work as this article pointed out is clean and lends itself well to ai). It isn’t human and it doesn’t understand or have awareness or an overview. It has a very very good memory.
Because it is not intelligent.
I find it strange that they all seemingly are working on slightly different angles of ai as if the entire thing is coordinated… the only real direct competition seems to be coming from foreign nations such as China… Europe likely won’t ever have its own as usual in the tech world. Museums aren’t great at having the latest gadgets.
(Edited)
Bingo - never going to be a robot doing that kind of work because a) it’s extremely complicated to do in the average house (painting a blank wall in an industrial shed would be a different story) and b) the cost of the robot to do that kind of work would be prohibitive. So painters, plumbers, etc are all safe - in domestic settings anyway.
Clearly you two aren’t in the tech industry… and haven’t heard of MYRO

If you think humans will do menial tasks in the future they don’t want to do then you should find a cave and make it your home
My rubbish is still put into the back of a dustbin lorry by a guy in overalls. And I've yet to hear of anyone building a nappy-changing machine.
If you think Optimus won’t be able to do all these things you are living under a rock… our only advantage is our mind and even that soon could be under threat…
I never said it was taking over the world Einstein
I am a bouncer
Or Indiana Jones crypt...
As an experiment, working with an AI academic researcher, I asked AI to summarise themes in submissions from scientists about the future. It gave me an interesting list, BUT I had already summarised the themes myself and come up with a very different, and more human answer, which was scientists are scared that policy makers don’t understand the implications of new tech. As a human reader, that jumped out at me. It was not listed by the AI.
Slowly, then suddenly.
(Edited)
The reason is because anyone who believed this would happen was listening to the wrong people e.g. sales people or straight up con men like Sam Altman, who know perfectly well the limitations of their product but just want hype/money. Listen to the researchers.
It’s hollowed out advertising - clients are using AI to bring advertising in house at low cost. Canary in the coal mine,
I've been marking undergraduate and postgraduate essays in recent years. I can tell the difference. The generative AI responses don't get the question.
Have you tried interacting with an AI chatbot? Most will not understand your query and ask if you want to speak to a human.

A
I am a veteran lawyer. I have used Claude Pro ‘on the side’ for work and have been thoroughly impressed. It’s better than junior associates and would give senior associates a run for their money. Wild!
If you ask ChatGPT about how to expand US hegemony with the annexation of Canada and Greenland, it will tell you all you need to know about how dangerous it will be rely on these applications for critical reasoning in the coming years.
The incompetence of senior management is the lifesaver of most jobs. It surpasses AI capabilities for now.
Can we please distinguish AI from generative AI? Because they are not the same.
high quality written work?
Clearly the author and I have different definitions of what high quality written works look like
The number of mistakes in the responses below shows how poor even predictive text can be, and how much more work it creates in fixing them.
The article misses a crucial point which is shown in Anthropic's latest Economic Index research (relased yesterday John).
AI is primarily augmenting work rather than replacing jobs entirely. Their data shows 57% of Claude AI usage is augmentative versus 43% automative, with no occupational categories where automation dominates.
What's fascinating is how the augmentation varies by profession - copywriters show the highest collaborative iteration rates, while translators demonstrate more directive behavior.
The new Claude 3.7 Sonnet's "extended thinking" mode is predominantly used by technical professionals and creative problem-solvers, suggesting AI is becoming a thinking partner rather than a replacement.

Brilliant :-)
Would you give ChatGPT direct access to your payment system and email credentials?

No?!

That's your answer.
Why hasn’t AI taken your job yet?
Because every time I ask it for contributions about my job, it gets the facts wrong.
AI will not take anybody’s job. It will facilitate our work and life by executing many tasks and functions in which we currently waste an enormous amount of time.
think its more nuanced. Anthropic have done some interesting research on how people are using AI . generally people are using it to augment rather than replace
But there are a lot fewer of them. In the 90's the US postal service employed about 900,000 people. As email took off, that number started to rapidly decline, and today it's about 600,000. There are several reasons that happened, but the rise of email/electronic billing and the corresponding decline of first class mail (which has fallen by nearly half) is the primary one.
Are we including UPS/FedEx in those numbers? Fewer letters, more parcels.
We are. That’s a secondary factor though, since it’s a much smaller number of items to start with, and a lesser decline in the percentage handled by post.
Some interesting findings, and time will tell about the future of all jobs… will we be replace by AI or will we use AI to supplement our work?
I fed insurance terms and a letter to both ChatGPT and Gemini. I asked it to highlight specific paragraphs and write me a reply highlighting contradictions between the letter and the terms. Both models hallucinated (ChatGPT needed another prompt to read the entire documents, it stopped at the first page) , misquoting in almost the same way the actual original letter from the insurance and built a response based on such drivel. Despite prompting again to quote exactly, both models couldn’t cope.
Now, I knew the answers already, just needed a writing assistant to write me a reply in another language. If I didn’t read the document my reply would have been utter garbage. The models can’t process or retain memory of large data.
Simple question: Would you get on a plane whose avionics had been designed by ChatGPT?

A much more likely explanation than the capabilities of LLM’s, which always seem to be greatly oversold by economists, is the chilling effect that AI is having on software development in general.

How many meetings do you go into where some joker says: “Can’t we do this with AI?”
Exams are tests of one's ability to memorise, so it's not at all surprising that a system that's been made to memorise the entire world's writing (including the answers to every published exam, ever) can pass one easily. Ask that newly-mortarboarded AI how to deal with a novel situation and it will utterly fail at the abstract thinking needed to create connections between knowledge that didn't exist before. LLMs are great at matching your prompt against their corpus but fundamentally cannot think how humans can think.

I'm not scared of LLMs being able to do my knowledge job, but I am very scared of CEOs who read the FT deciding that it should try anyway.
That’s not what LSAT and SATs are. These exams are pure memory with some ability to interpolate. We (humans) conflate memory with intelligence
As a software dev interested in this space for personal reasons, I think the trend of jobs going down in this space is more complicated than just AI.

1. Coronavirus over hiring for a future of mainly remote work didn’t transpire and the job market is now regulating.

2. Overinvestment in AI not leading to immediate profit, it’s been two years and no company yet would still release an AI-created product (bar AI companies whose product is AI) to market and stand over it without human review first. Shareholders need to see return for their buck and the easiest way to achieve this is to reduce numbers while waiting for the AI investment eggs to hatch.

AI is a brilliant tool, but nobody is sure how to capitalise financially on it yet, even OpenAi who we must remember are vastly in the red.
It’s a fair prediction but the guess in IT is just that entry level and junior roles will be the ones cut with seniors becoming ML code reviewers. The problem here is when seniors start to move up to management or retire, who will replace them. As long as AI hallucinations make it that a human always needs to review their content, then this is just a problem being kicked down the road.
Btw it was never that hard to write software. Instagram was created by a staff of around a dozen. It was all the rest of the effort, promotion, marketing, monetisation strategies, novelty of the idea, designing of the feed concept, and the resultant network effect that mattered more. Yes AI can help streamline the feed, but I don't think if LLMs had existed back then it would have made Instagram any more successful than it had already become in any case.
I get it, AI doesn’t help as much with messy work. Can it help with messy workers though? Asking for a friend …
It's not good at anything interpretative.
(Edited)
It's a great auto-complete but you still need to know what you are doing. Software engineers are definitely safe but now have a useful new tool to help with their productivity so long as they use short-cuts wisely.

However, the amount of issues and errors that will be created by AI hacks will be enormous and those will all need expert debugging. Be prepared for some very fragile software going into production as too junior developers leverage AI for massive overreach.
Or more work gets done.
After all, as Parkinson showed us, work can produce an infinite amount of secondary work. Look at the IT industry itself and see the amount of inward facing activity (such as SDEs, IDEs, VCS, languages, etc) as opposed to that seen by the end users.
Comment from a software engineer with ten years experience ' i'll be ok for a good while but this will take out a fair amount of grunt work... we'll need fewer junior employees'
(Edited)
Well that was clearly a ridiculous estimate then. He needs to hire better developers. Either that or this is not the whole story and they had their reasons which he will find out when the first crisis in production happens.
One of the problems with debugging some else's code is deciding how it works, especially if no comments have been included. This could be difficult if it's written by AI.
AI coding already includes the facility for commenting, and people routinely run agents like GenAIScript to debug and add comments to existing, older code.
You’ve never had developers raise issue after issue because it’s not what they’re used to?
Some people are more confident with new tools than others. I remember our tech support guy telling me 20 yrs ago that we should replace our local server (£4k +) as cloud computing would never take off.
I have a German colleague whose email style has changed dramatically over the last few months. Suddenly he sounds almost like a journalist, with a very chatty style. He admitted he now uses ChatGPT to generate his emails. Really annoys me that he expects me to use my brain to read and critique his ideas, but he never used his in the first place. Bland, random ideas in a chatty email... no thanks.
Precisely this!
tempted to ask such people if they could first summarise their email using an LLM before sending it
I used to work for a large multinational. It decided that we had to book our own flights. Very easy when i wanted to go somewhere and back. Almost impossible if I wanted to go somewhere, then somewhere else and fly back from a different airport. A travel agent could do it in 5 minutes. I used the travel company's agents and got a large bollocking even though I'd saved money and used my time productively.

I bet my former employer is mindlessly using AI for tasks better done by employees.
I suspect that many people's work isn't useful to begin with so it doesn't make any difference if it can be automated or not.
I thought the FT was already written by AI.
If you have the ideas, then we can look forward to an AI driven column every weekday in the FT and a subdtack at the weekend.
nice article
Actual, specific, successful significant use cases for AI in business rather than research situations are rather uninspiring.
Very good article. Would be really interesting to know the implied job “loss” here eg what’s the difference between the wider sector and specific job trend in number of jobs.
(Edited)
I am still using Ms Excel. if AI is so great why my work has not been automated yet?
Excel is a simple application, surely much vaunted AI would be capable of replicating its functionality?
(Edited)
If you've ever used it you would know that Copilot in Excel is utter garbage. Literally incapable of relatively simple tasks and dangerously comes up with obviously incorrect answers. Copilot is just a stochastic parrot LLM and hence is fine for coming up with PR blurbs (as they tend to be fact-free anyway and nobody with any sense reads them) but not for anything where you need to be sure that the answer is correct (e.g. in finance).
Too right!
Can AI truly do analysis? Can it do associational thinking between small ideas to form an innovative new big one?

Genuine question.
AI does not think. It doesn't understand what it writes. It cannot be quirky or inventive. It can pillage someone else quirky idea if it can be found on the Internet but it cannot invent one.
(Edited)
Really interesting seeing how academics are using it to support and speed up tasks, but then using their skills to develop, test and evaluate outputs into something more meaningful and credible.
It is how it was meant to be used, just check each and every ‘serious’ SF story that premeditated it.

(It can do it for you, if you are short of time).
Thank you Tabitha
Thank you Markdoc
How much of this ‘messiness’ in some jobs is truly producing value (net income)?

And how much of it is because humans are jealous, spontaneous, have feelings, and don’t communicate perfectly?

The whole field of industrial engineering is sorting through these messy systems and finding ways to systematise and automate.
We wait for an overuse of emdash, streamlining and superlatives! In your future work 👀 🤖
Not sure that "OpenAI’s ChatGPT can comfortably pass difficult exams such as the notorious US Law School Admission Test and Ivy League MBA finals" is a comment on AI, it's more a comment on the use of exams to discern intelligence/potential for employment.
I'd like to hire someone who could pass an exam AI can't.
Excellent analysis.

A thought. If admin staff, junior engineers and coders and the like became super users of GAI tools, would that make them indispensable to senior executives and
managers rather than temporarily useful with an eventual sell by date?

Welcome thoughts on this as I am saving junior engineers on their future career paths and anxiety is clear and present.
Thank you. Sage advice.
I don’t think a LLM could write the article on tariffs from Brevan Howard’s “economist”.
Truly absolute drivel
Speaking to someone about their work the other day. They sounded quite pleased. Why, I asked. Oh, we've just won a £5M bid for a contract over 5 years. Their role? Technical sales. How did you manage that I asked as they have limited knowledge of this fast evolving technology. Oh, I took the RFP and used AI to generate a response.
Just imagine all the suppliers bidding using the same approach.
That would assume the buyer's knowing what they want ( what they really really want as some pop group might once have said) and the suppliers knowing the capability/limitations of their services/products and both parties being a bit honest.
Yup - same with job applications, reports etc. We have to rethink why we are generating so many words and if they are necessary
Luckily no one knows what I do, least of all me, so no one has set their mind to using AI to replace me.
If the AI looked too hard, he might not be there anymore.
Because it’s mostly useless. Try even getting copilot to make a sensible to do list from a bunch of notes in one note. It’s laughably bad.
Disagree on software engineering. Software engineers are everywhere, they don’t always cluster with other jobs in “IT” or whatever in the same proportion. I think this is more to do with the downturn in market post-2022 more generally.
I haven't seen good work product in my job from any of the LLMs, proprietary or open. I've sat through many conference presentations on the idea over the past few years. The people in my field who can be replaced by AI shouldn't be working in it.
(Edited)
It can take my job as long as I get to keep my pay.
(Edited)
Indeed so. All technology is deflationary- it's not progress otherwise. Unfortunately our Aliceland fiat monetary system papers over the cracks.
I am a SAHM.
That’s, like, for life.
The reality is that AI is very good at producing words and code from human prompts, but very bad at doing any of the underlying thinking that makes a piece of writing or code useful.

It would be more accurate to call this technology a "writing/coding assistant" rather than "AI". We've already seen similar advances with things like machine translation, which resulted in organisations cutting costs by getting basic translations of a text done, but didn't result in all translators losing their jobs.
Excellent reporting, thank you
Because ChatGPT chats, workers work.
(Edited)
As per your own graph, their task success rate has gone from zero to 65-95% in less than 3 years. That is absolutely incredible progress.

Just imagine where we will be in another 3 years.

Would be interesting to compare that to other technology, maybe EV adoption as a % of total number of cars sold, streaming services, smartphones Vs. traditional mobile phones etc.

I think what we'll find is that the growth of AI in the workplace will be scarily fast.
Another way of looking at it - everyone will become a manager.

The actual work will be done by AI, but it will rarely get it right the first time, so managers will tweak, re-instruct, edit and modify its work.

Senior managers will train junior managers how to get the best out of AI.
I wouldn't say managers - instead, think of AI tools like other software. I'm just old enough to recall when businesses had specialized "document production roles" - secretaries, typists, slide-makers, even photocopiers (people who made copies, not the machines :)).

The advent of office software and printers made all those roles redundant, but it didn't turn everyone into a manager: those roles were folded into our own daily work.
That's a more worrying way of looking at it:

An incredibly stressed professional class expected to produce huge volumes of material using technology, while being held responsible for every mistake the tech makes.

Everybody else told to go and do menial labour for a pittance.

A tiny plutocrat class at the top, who leave the country if they are required to pay any more in the way of tax, while operating their businesses remotely as if they were still in the country.
An incredibly stressed professional class expected to produce huge volumes of material using technology, while being held responsible for every mistake the tech makes. Everybody else told to go and do menial labour for a pittance.
That’s the extreme view, but …. yeah.

And it’s been my experience with every technology advance that enables ordinary people to do what was previously skilled work. The specialist role vanishes: the work gets folded into ordinary tasks.
Interesting, thanks for the insight!

I am not directly involved in ai, though I keep a very close eye on it, and I actually look at it less as broadband internet, and more like just the 'Internet'. I don't think it's going to speed up existing infrastructure, but it is an infrastructure that is going to become completely ingrained and necessary to our day to day functioning as a society.

There will be parts of the world that have slower versions, like you said, some rural areas, but most GDP is made in the cities anyway. And in any case, I think even in those areas, less powerful models will still be able to transform people's work and lives.
Not a great comparison nor argument. There are biological limits as to how much a human can weigh, but there are technically no limits to efficiency gains brought on by A.I.
I think you mean there are "hypothetically" no limits. My hunch is that we're hitting against the limit of the intelligence levels that can be realised via the current approach to AI.
Yea, it could definitely be one of those situations where it's just early gains, which are almost always quick as going from 0 to 1 is infinite growth Vs. 1-2 which is just 100%. Alternatively it plateaus or slows dramatically.

However, if we go by the actual chart it's measured against a finite end point - "ability to automate human jobs". This isn't measuring using an arbitrary metric; you have a human doing a job, and you have ai and you are seeing how much of that persons job can be done by the ai.

It might be a case that the final 5-10% of any job is the finicky, messy, out of the box thinking that is incredibly hard for ai to do. But

1. I think progress so far has shown that even then, we could potentially get to a point in the not too distant future where that is overcome, and more importantly

2. Even if it can't cross that final 5-10% you still have software that can do 90-95% of a human's job 24 hours a day, 7 days a week, 365 days a year - they're never sick, never lose patience, never give up and never get bored.

Think about the efficiency gains we as a race could make if we harnessed that power appropriately.
We use Copilot and other similar tools in software programming. Mind the word programming which is not the same as development. Programming is taking those small, easy tasks and putting them into code. Development is much more complicated and includes requirements gathering, problem brakedown, software design, testing and so on.
When Copilot has similar code around, that is, in the same file or another file that's open to take inspiration, the result is good enough to be used mostly as it is generated. Take that context away and the results are simply not usable.
I can see AI - but can we still please call it machine learning for now? - taking over those small tasks but for the software design part the context is so big that even the business analysts (those are the expert in the field, say a person how is specialized in building hotel management software etc) have trouble properly writing requirements.
And software development, while liniar and repetitive as it may seem, quickly gets messy as you grow the number of people involved. You now have different coding styles, different parts of the big problem solved in a completly different team that you need to interact with (e.g.: have a proper software communication contract), deployment and testing mess and so on. Software development is not only about writing the code but taking those decisions about scalability, fault tolerance, testability, maintenance etc which are most of the time compromises depending on factors like budget, time to market, security and so on. All this is so complex and you need so much context that suddenly, that linearity is very blurry and you need to taken human based decisions.
AI might help in the future but we just aren't there yet. And it will take a while for the CTOs in the world to understand this, I assume.
Yes, my reply is not a ChatGPT answer. :))
Actually it’s the inability of the user to understand the input-output correlation and how to manipulate the black box behaviour to repeatedly get the same desired output from the same input. Generative AI works. People just don’t grasp how to use it.
This needs to be emphasised.

So many of the people posting here that AI "doesn't work" or "produces rubbish results" pretty clearly don't know how to use it and are seduced by the fact that you can get a response to a native language query, into thinking that that is all there is.

I was thinking of this last week when talking about Powerpoint, of all things. I had a teaching assessment, and afterwards the assessor gushed about interactive features in my slides. I said "They're just visual basic objects" and she looked at me like I had suddenly started speaking ancient Greek. It brought it home to me how many people don't actually know how the software tools they use every day, really work.
(Edited)
. Actually it’s the inability of the user to understand the input-output correlation
Heem.. that depends on whether you expect AIs to be used as an expert tool, or as a generic tool.

Most consumers using a smartphone would not be expected to understand how the OS works, or press buttons in the right order in order to open an app.

Neither does FT expect the users to click in the correct sequence in order to be able to open an article :-)
or I could do it myself in half the time.
Check out tools like CodeNarrator. You'll still have to validate the generated documentation, but that's a lot faster than writing it from scratch, and any areas it has problems with may be complex enough to warrant extra documentation for the next dev anyway.
Good article . Yes the more messy the job the more difficult to train an LLM . It's all about the quality of the dataset, and sadly they can be near perfect for programming jobs. If you have a high quality dataset for a particular job function then LLMs will exhibit super intelligence compared to a human. The best tutor by far that I've ever had for learning data science is.... Copilot in visual studio code. Sad to say it's unbelievably good. Time to move back into a ' messy 'job ASAP.
As far as I’ve seen, AI falls short at ‘connecting the dots’ in an intelligent and original way. The AI’s inability to distinguish between a ‘new insight’ and a ‘hallucination’ will for the foreseeable represent a barrier to AI disrupting the top tier of journalists. Indeed, if 80% of what we read in the future is derived from LLMs (a depressing thought), the perceived value of quality output may actually rise…
Answer
Because you are doing your job both at a higher quality level and more of it. Your risk is that your colleagues are better at training AI
#opentowork
Interesting starting point but the wrong conclusion in my opinion - it's much more simple than that. What do LLMs output? Text. What do programmers / writers output? Text. What does an EA / travel agent / bookkeeping clerk (which incidentally are jobs this article reads as extremely belittling of) output? Text, calendar entries, verbal communication, numbers in a spreadsheet, flight bookings, etc. The simple answer is that LLMs have little ability to interface with any of our non-text systems. While you are right they struggle with non-linear tasks, I would argue that programming / writing are very non-linear and require repeated iteration. This journalist will be replaced before the "entry level" skills required by the aforementioned are, I promise you.
“Why should I be bothered to read something no one bothered to write”
A while ago, I was sent a lengthy paper by some AI company. They claimed the text and illustrations were done with the help of their models. I never read that thing - why invest so much of my time, I might as well use my own prompts
Yes exactly, and the idea of reading is usually to find out what a human thinks.
This is exactly my reaction to publishers asking me to buy their AI-generated books. Why should I waste time reading something a machine "wrote," by stealing original ideas from actual people?
(Edited)
Because there is no AI.

If one day it comes into existence, we have to assume that as the dominant life form it will take more than just our jobs.
Here’s hoping!
(Edited)
Welcome to Stepford, where an improved AI spouse awaits you.
You have several????
are you grammatically challenged?
"will take more than our jobs"
Are you implying that John meant he and his readers each have several jobs? If not, your response makes no sense.
twas just a joke
Well, no: because columns like this also have to account for and analyse messy human interaction, because that’s largely what’s generating most of the data.
I am not sure its that easy to automate a software engineer's job. What happens when there are bugs in the code? How will it fix bugs without breaking something else? Software applications cannot tolerate crashes. May be AI can get there but it will have to be really proven in a solid manner which might take many more years.
Are govt/companies gonna be ok with ai (hosted abroad) trawling their confidential data/code? There have been leaks due to AI use I believe.I think a lot of issues come due to defining requirements. Sometimes this is farmed out to a non-technical person (unskilled in either domain or coding) - which can cause a lot of problems. Perhaps engineers would move more in this direction. But unless they consciously train junior engineers in the basics, there will be a crisis in a few years due to loss of skills.
Not really - you would probably only want your own ringfenced Agent on your codebase, but then most companies won't be able to bring this in house entirely - you can train a cloud-based agent on general data and then not allow it to learn from operational data, but still early days for testing this properly...
hosting your own AI and customising it to not train on current data sounds like it is going to take quite a lot more investment. I’m guessing they would hire engineers to do this. I guess also ai trained on general data would be less good at specialist fields (law, medicine)

It’ll take a lot of work to find the optimum balance of risk-benefit. Right now all the senior people just want to say their teams ‘use ai’…. Actually in the short term, you are Either going to be less efficient (having to set up ur own ai model) OR you are taking risks that you leak some sensitive data….
AI hasn't been good enough, but that is starting to change. Also, people are still trying to figure out how to use it effectively and plenty of people have their head in the sand.

I'm starting to government an internal seminar series into how to automated how processes from next week. And frankly that took a lot of figuring out. I suspect similar seminars will start occurring in workplaces across the country over the coming year.

The first losses will be a virtual - I.e. a very tight hiring market. But I susoect real losses/redundancies will start to occur in 2026.
Yes, consistent workflows are good for structured agentic AI (formerly called automation). As interoperability standards like MCP mature, agentic AI will be able to handle increasingly messy workflows.
I don’t agree that it produces good writing. It tends to be littered with strange hyphenation and telltale signs. This is obvious once you’ve used it. It can be part of a writing process but not the process.
Just needs more 'delve' and 'ushering in a new era' and the slop is complete.
It is amazing how recognisable GPT-4's output has become. I suspect there is a strong trade-off between writing style variability and size. The base model, before distillation, can predict the next token across all of its training data I.e. the whole internet. The chat models always write in the same, generic 'help desk' styles.
(Edited)
Re your point 2. Its being treated as a replacement by the rich powerful, greedy and short sighted. Short termist profit no matter the cost to society. My prediction that the limiter on ai would be precisely that seems to be coming to fruition.
Well spotted. The future may not be about replacement but enhancement. Making humans more productive rather than completely replacing them. I am retired but I find even the simple free models useful in every day activities. They are also useful to talk to when trying to think through philosophical issues. I find it fascinating that I can get them to change their initial answer by debate. Presumably this is pert of their training. Humans need to learn how to use these models and the more sophisticated versions but in doing so they do enhance their productivity. It is through this improvement in productivity that jobs in messy occupations will be lost.
No chance, JBM. You are essential reading.

Your pin-pointing of copywriting is spot on. There is next to zero hiring in marketing communications.
Without customers.
There is no business.

Very simple reality that many seem not to understand.
(Edited)
Yep. The usual answer provided is everyone said the same thing over the shift from hunter gatherers to collective societies and an agrarian economy and then the same thing again for the shift to the industrial and subsequently IT revolutions. A nice easy throw away statement. The reality is though that previously people could offer either brawn or brains or both at each of those transitions. Now they’re telling us that between robotics and ai and the link up between the two we can offer nothing so what’s left for us.

Who pays the taxes?
Who buys the goods made?
Who purchases the services offered?

When people don't have jobs or when people have nothing to look forward to except leisure and have no structure to their lives they tend to get restless and cause trouble or become depressed. I have serious doubts as to whether even running an alternative economic non profit driven model in which ai/robots do all the work will result in any sort of better life for humans. Short term though it will enrich a very small very greedy group who currently hold the top jobs and positions and allow them to party like they’ve never partied before, whilst everyone says gee wow and looks at the next amazing ai/robotics breakthrough shortly before the wheels come off the cart.

It only works if it enhances and augments and doesn’t replace otherwise it has no value. It’s also very expensive on energy and cooling (water) so carries some serious costs they don’t like to talk about as we’re short of both.
Don’t worry John. Originality matters. Discovering new or interesting patterns, which is what the top writers do, is a lot more complex than what AI does. Maybe down the line with more appropriately curated workflows AI will be able to churn this stuff out.

AI generated articles so far (in my experience) tend to be of the clickbait enshittification variety you find on blogs turned aggregated websites (eg various tech review and gaming “news” sites) rather than quality journalism.
Agreed, feels a bit “artificial"
I'd be absolutely astonished if John didn't use AI to help him with this article. So a productivity gain, not direct replacing. But cumulatively across all the "data-driven columnists" in the country/world, does that result in us needing a fewer, or do they just create more value for society i.e. us, the readers (and hence the FT, etc.)?
Usually technology (over time) has improved productivity rather than put humans out of work. Ask the luddites 😉
....but can it fetch my lunch, buy my partners' birthday gifts and flirt with visitors before providing me with valuable insights after they've left.

A virtual assistant has a long way to go.
Ukrainian offiice girls hard to replace.
(Edited)
Hi Alexa, order my bento and send the other half some jewellery whenever it's their birthday.
(Edited)
Paradoxically, AI has made me far more aware of mediocre human writing. If you take a mediocre essay (not difficult to find in today's day and age), which goes over the talking points you'd expect, and which digests the most obvious information available for it, a product of AI has become near-indistinguishable from humans. Well-written opinion pieces, original perspectives, and non-obvious topics to report on? Language models don't hold a candle to humans. When I do find a mediocre (hopefully human-written) article or essay, I'm far more critical of it than I would have been a few years ago.
Interestingly one of the first types of writing automated were insult bots.
Apparently negativity is simple to replicate.
....and yet here I am. 😉
Yes, you are simply here.
A human writer assisted by AI can create extremely good results.

See it for what it is: a tool.

Tools can be used by both by craftsmen and n00bs, with very different results...
But can they tell, because the AI pieces are written too well?!
There's already a very significant difference in ability between the free, open versions of AI apps that the majority or people still use and the paid, subscription versions. I suspect even more of a difference with the cutting edge models under development and not yet 'out there'.
Does it perhaps seem that way because the people using the AI are mainly generalists?
Having both marked hundreds of essays in the Chatgpt era and taught fellow academics about the use of AI in writing I'll tell you what I tell them; AI is like plastic surgery. You will only notice it when it is done badly!

There's a huge element of survivorship bias in people's perceptions of being able to spot LLM written material.
Yes, agreed.

Student sticking an essay question into chatgpt and copying and pasting the answer into a word doc, sore thumb.

Lightly using it to generate structures, rework existing materials, restructure arguments, giving it a good list of readings to start. I'm not so sure.

BUT I don't know at what point that's just using it as a research tool that frankly most academics will be using sooner or later.
Excellent comment!!
already talented can become a lot more efficient when using them, while those who were less capable to start with can actually get worse.

Excellent observation
Which suggests that companies will figure out how to find those people and give them AI assistance. One excellent software developer plus AI instead of nine mediocre ones and one who’s fixing the others’ while silently despairing.
people who were already talented can become a lot more efficient when using them, while those who were less capable to start with can actually get worse.
This also feeds into the fact that current LLM-based AI is most effective when it does the grunt-work, but a human remains in the loop for quality control. Having quality controllers who are, well, poor quality themselves is clearly not a great idea.

Sadly, this is a truth still to be acknowledged by companies who think AI is a painless way to reduce headcount.
An interesting question is how much can/will people in various jobs improve when they are provided with a decent amount of AI literacy training?

Currently, people use AI experimentally, so it’s no wonder we see mixed results in how not everyone finds AI beneficial/effective/accurate etc.
(Edited)
TLDR so let me AI that for you:

"The main observation from working in the Med Tech industry is that LLM tools, especially for programming, tend to benefit already talented individuals by making them more efficient. However, less skilled users may become more inefficient, relying on these tools as a crutch and producing incorrect or inefficient code. To use these tools effectively, one needs to understand their strengths and limitations, and be able to step in when the tool struggles.
The same principle may apply to other fields, like data journalism, where LLMs can help with writing but may not offer advantages in choosing topics or analyzing datasets. There's also the risk of hallucinations in the outputs.
The author also mentions that jobs with complex, messy workflows, such as those in the civil service, will be difficult to automate or make more efficient with AI, particularly in the current government's plans."
Honestly, I see much the same thing. They used to say that a poor workman blames their tools. There is still a lot of truth to that.
A sloppy EMR system can create significant problems and costs. A well run one reduces problems and costs.
Sloppy automation in a lot of fields just increases the volume of the messiness.
Turning chaos into order is a certain way of thinking, and not everyone using AI has that perspective.
Well said!
Your best defence against this would be to make yourself a brand name for quality data crunching journalism, like that bloke, what’s his name? Yes, that one, John Burn-Murdoch ;)