AI is Going to Replace Most Work
Post created: Fri, 11 Apr 2025 00:42:00 +0800Clickbait (not really)
Background
Here are just my thoughts on AI (in particular Large Language Models, since it is currently the most impactful AI system for normal people). I do write them in point-form, so donāt expect coherence here :(
Requirements for AI Replacement
- A lot of people did the same/similar work previously
- Examples were readily available online
Statement: No matter how powerful LLMs are (reasoning, agents, long context retrieval, etc.), they still require previously seen knowledge and cannot solely rely on their āhallucinationā.
Hypothesis: AI with significant reasoning ability cannot complete a task flawlessly if it is completely new to either humans or AI systems in general.
Result
AI is able to train on and remember these examples, letting anyone do the same thing with minimal prior knowledge.
Solution
Go to fields that have minimal examples:
- Fields with few practitioners
- Relatively unused fields like certain historical niches
- (Though I likely wonāt study these fields)
- Useful fields that require significant knowledge and research (e.g., SOTA)
- Relatively unused fields like certain historical niches
- Fields with no patterns
- Fields that are purely random in nature
- (I likely wonāt succeed in these fields due to their indeterministic nature)
- Fields that are purely random in nature
- Fields that constantly generate new examples
- Faster than what AI can learn
- Question: As AI scales up every year, what fields can really keep pace?
What AI have I been using recently?
ChatGPT 4o (on chatgpt.com)
- Image generation ability (as of Apr 11, 2025)
- Native generation is really impressive and is able to form correct, clean text such as banners or blackboards, etc. However, there are still some issues with the geometric understanding.
Gemini 2.5 Pro Preview (on aistudio.google.com)
- Really impressive long context retrieval and reasoning ability
- It is able to perform calculation step by step (not skipping steps unlike other AI models when they receive 10 math questions and told to do all at once). It is able to make best informed decision according to the context provided. TL;DR, it is attentive to details (Yes K.P. Wat!)
- Unlimited usage and free, canāt demand more, right?
Gemini DeepResearch (on gemini.google.com)
- This is really impressive. I used it for several purposes:
- Finding scholarly published papers and using them as my APA citations for my essay assignments and other stuff
- Performing in-depth broad range analysis and investigation on one particular field or topic
- It is able to consider almost all aspects of the topic (primarily due to the mass amount of information it receives on the internet, and its reasoning ability) and generate a detailed report. If my prompt is detailed, then I would expect more than 10 pages, 5000+ words report (including references).
- I almost built a workflow for this:
- (This should be step 0) Come up with an interesting topic that I want to read about and spend time on it
- Ask Gemini 2.5 Pro, or other AI models to understand what I donāt know, and to provide me with a detailed prompt which covers various aspects of the topic to be explored on
- Throw the prompt into Gemini DeepResearch
- In the meantime, generate a $\LaTeX{}$ template using either Gemini 2.5 Pro or Claude 3.7 Sonnet (you may ask why I use this? It is because I bought Claude Pro, so not using it feels like wasting money š )
- Wait until the Gemini DeepResearch finishes
- Throw the entire report into ChatGPT 4o and let it generate a cover page image (using its native image generation ability)
- Ask any of the LLM models mentioned above to amend the $\LaTeX{}$ template in order to adjust the section, subsection and other stuff that the DeepResearch-generated report gave us
- Copy respective sections from the report and paste them into the $\LaTeX{}$ template
- Fix the errors that arise, this step should take around 5-10 trials depending on luck
- $\cdots$
- Done!
- In fact, you can see one of my previously generated report samples here
Claude 3.7 Sonnet (thinking) (on claude.ai via my Claude Pro subscription)
- Before the GPT-4o image generation ability was released, I thought this was the best decision (of subscribing to an AI service) because Claude was the strongest LLM model at (web development specifically) programming and I have a web development course (COMP3322)
- It turns out GPT-4o native image output released few days after
- Gemini 2.5 Pro followed next
- So it was kind of not a good time to buy a subscription lol
- UI is generally good, with support of React artifacts (I primarily use it for data visualization since I donāt need to run it on my python lol)
- In terms of the attention to details ability, it is not as good as Gemini 2.5 Pro (which is able to perform step by step calculation). Claude 3.7 Sonnet (thinking) would just summarize everything if I didnāt explicitly ask it to explore every step in-depth