It’s hard as fuck to use LLMs for financial research. I did it anyways

The challenge in converting English into LLM Functions

In partnership with

`

Long Angle: A Vetted Community for High-Net-Worth Entrepreneurs and Executives

  • Private, vetted community offering confidential discussions and education

  • Entrepreneurs and executives, 30–55 years old, with $5M to $100M net worth

  • Preferential access to top-tier alternative investments

Exciting News: Paid Subscriptions Have Launched! 🚀

On September 1, we officially rolled out our new paid subscription plans at GuruFinance Insights, offering you the chance to take your investing journey to the next level! Whether you're just starting or are a seasoned trader, these plans are packed with exclusive trading strategies, in-depth research paper analysis, ad-free content, monthly AMAsessions, coding tutorials for automating trading strategies, and much more.

Our three tailored plans—Starter Investor, Pro Trader, and Elite Investor—provide a range of valuable tools and personalized support to suit different needs and goals. Don’t miss this opportunity to get real-time trade alerts, access to masterclasses, one-on-one strategy consultations, and be part of our private community group. Click here to explore the plans and see how becoming a premium member can elevate your investment strategy!

An LLM answering which stocks are similar to TSLA

If I asked you, which stocks are most similar to Tesla, what would you say?

One investor might start listing off other automobile stocks. They might say stocks like Ford and Toyota because they also have electric vehicles.

Another investor might think solely about battery technology and robotics.

And yet, a third might just look at technology stocks with a similar market cap.

This is an inherent problem with language. Programming language languages don’t have this issue because you have to be extremely precise with what you actually want.

And because of this language barrier, it is extremely hard to effectively use large language models for financial research.

And yet, I did it anyways.

AI could turn $10,000 into $1M in the next few years? – here’s how

Artificial Intelligence is being called the next $100 trillion industry—and early investors could have a rare chance to profit.

Renowned investor, James Altucher, believes AI will be the biggest financial opportunity of the decade. He has a history of making accurate predictions about emerging technologies, and his latest focus is on AI.

Altucher claims that a $10,000 investment in the right AI stocks could turn into $1 million or more in the coming years.

But this opportunity won’t last forever. He predicts that the window to get in on these AI investments will end very soon— and in this video explains how everyday investors can take advantage of this “wealth window” before it’s too late.

Plus, he’s also revealing one of his top AI stock picks for free. Don’t miss this chance to get in early on what could be the biggest tech revolution yet.

The Problem With Using Traditional Language Models for Financial Research

Naively, you might think that ChatGPT alone (without any augmentations) is a perfectly suitable tool for financial research.

You would be wrong.

ChatGPT’s training data is very much out of date

While ChatGPT can answer basic questions, such as “what does ETF mean?”, it’s unable to provide accurate, current, data-driven, factual answers to complex financial questions. For example, try asking ChatGPT any of the following questions.

  1. What AI stocks have the highest market cap?

  2. What EV stocks have the highest free cash flow?

  3. What stocks are most similar to Tesla (including fundamentally)?

Because of how language models work, it is basically guessing the answer from its latest training. This is extremely prone to hallucinations, and there are a myriad of questions that it simple won’t know the answer to.

This is not to mention that its unable to simulate different investing strategies. While the ChatGPT UI might be able to generate a simple Python script based for a handful of technical indicators, it isn’t built for complicated or real-time deployment of trading strategies.

That is where a specialized LLM tool comes in handy.

Distilling real-time financial knowledge into an LLM: Function-calling

Specialized language model tools have are better than general models like ChatGPT because they are better able to interact with the real-world and obtain real-time financial data.

This is done using function-calling.

Function-calling is a technique where instead of asking LLMs to answer questions such as “What AI stocks have the highest market cap?”, we instead ask the LLMs to interact with external data sources.

This can mean having LLMs generate JSON objects and call external APIs or having the LLMs generate SQL queries to be executed against a database.

How function calling works for SQL queries

After interacting with the external data source, we obtain real-time, accurate data about the market. With this data, the model is better able to answer financial questions such as “What AI stocks have the highest market cap?

An accurate, up-to-date answer on which AI stocks have the highest market cap

Compare this to the ChatGPT answer above:

  • ChatGPT didn’t know the current market cap of stocks like NVIDIA and Apple, being wildly inaccurate from its last training session

  • Similarly, ChatGPT’s responses were not ordered accurately based on market cap

  • ChatGPT regurgitated answers based on its training set, which may be fine for AI stocks, but would be wildly inaccurate for more niche industries like biotechnology and 3D Printing

Moreover, specialized tools have other unique features that allow you to extract value. For example, you can turn these insights into actionable investing strategies.

By doing this, you run simulations of how these stocks performed in the past – a process called backtesting. This informs you of a set of rules would’ve performed if you executed them in the past.

Changing your insights into testable investing strategies using natural language

Yet, even with function-calling, there is still an inherent problem with using Large Language Models for financial research.

That problem is language itself.

The Challenges with Using Language for Financial Research

The problem with using natural language for financial research is that language is inherently ambiguous.

Structured languages like SQL and programming languages like Python are precise. It’s going to do exactly what you tell it to do.

However, human languages like English aren’t. Different people may have different ways for interpreting a single question.

The list of stocks similar to NVIDIA according to this AI

For example, let’s say we asked the question:

What stocks are similar to NVIDIA?

  • One investor might look at semiconductor stocks with a similar financial health sheet in 2023

  • Another investor might look at AI stocks that are growing in revenue and income as fast as NVIDIA

  • Yet another investor might look at NVIDIA’s nearest competitors, using news websites or forums

That’s the inherent problem with language.

It’s imprecise. And when we use language models, we have to transform this ambiguity into a concrete input to gather the data. As a result, different language models might have different outputs for the same exact inputs.

But there are ways of solving this challenge, both as the developer of LLM apps and as an end-user.

  1. As a user, be precise: when using LLM applications, be as precise as you can. Instead of saying “what stocks are similar to NVIDIA?”, you can say “which stocks are similar to NVIDIA in industry and have a 2021, 2022, and 2023 fundamental stock ranking within 1 point of NVIDIA?”

  2. As a developer, be transparent: whenever you can, have the language model state any assumptions that it made, and give users the freedom to change those assumptions

  3. As a person, be aware: simple being aware of these inherent flaws with language will allow you to wield these tools with better precision and control

By combining these strategies, you’ll unlock more from LLM-driven insights than the average investor. Language models aren’t a silver bullet for investing, but when used properly, can allow you to perform research faster, with more depth, and with better strategies than without them.

Concluding Thoughts

Nobody ever talks about the pitfalls of language itself when it comes to developing LLM applications.

Natural language is imprecise, and leaves room for creativity. In contrast, structured languages like SQL and programming languages like Python are exact — they will always return the same exact output for a given input.

Nevertheless, I’ve managed to solve these problems. For one, I’ve given language models access to up-to-date financial data that makes working with them more accurate and precise.

Moreover, I’ve learned how to transform ambiguous user inquiries into concrete actions to be executed by the model.

But, I can’t do everything on my own. Language itself is imperfect, which is why its your responsibility to understand these pitfalls and actively work to mitigate them when interaction with these language models.

And when you do, your portfolio’s performance will be nothing short of astronomical.