A point for conversation, re: LLM's

Dabnis · October 18, 2024, 10:07am

I was recently doing some development and using LM Studio to run my local models. I had an issue that was ‘strange’ with one model, in that it was not any better, or worse than my standard goto Llama 3.2 instruct, it was just so very different in the output it was giving, how it was interpreting prompts in a way that was so different to any other model that I had tried.

I looked at the prompt structure in LM Studio and to my surprise there were many Chinese characters there! so I checked the huggingface model card, also predominately in Chinese, and yes I have confirmed it was Chinese.

The above led me to investigate LLM’s China, etc.
I was surprised to learn that close to 50% of open source (free) LLM’s originate from Chinese sources see here. I don’t have 100% confidence in what I read on Reddit! so I investigated further to confirm that what was being shown was probably true.

My mindset, like most is that I don’t generally trust such as emails from places like China, Russia, etc. Rightly, or wrongly I became suspicious. My grandmother would often tell me, ‘nothing in life is free, and that if someone is offering you something for free, then they will have a reason for doing so’.

I have now drifted a long way from my original reason for checking out this model, but I became intrigued by this situation.

Think of the basics of how an LLM functions: Calculating the probability of what the next token (vector component) will be. If we expand on this and for simplicity rephrase token to word. Then we can say that the probability of next word leads to sentence, etc.

What if, while building these free open source LLM’s, suble changes are made to steer the outputs towards a certain dirtection? You get my drift?

If this is so, then I’m sure that all nations, not just China could see potential in manipulating opinion, learning and anything else that may be to their benefit. **REM the USA in second in the list of top provider nations of free open source LLM’s. Living in the UK, I’m confident that ‘if’ such is possible then even my own goverment would see potential, this is not a China thing.

Given how much of our lives are becoming dependant on AI/LLM’s is the above something that should be considered for all LLM’s?

While there may be little to no evidence at present, we must ask to what extent we are becoming reliant on LLM’s. Where a huge market share is being soaked with free open source LLM’s, that in time such a reliance on this market could be manipulated?

Am I being paranoid, or is this something thats worth concidering?

matt · October 19, 2024, 1:30pm

I don’t think it’s paranoid but couple of points but you have to remember that LLMs generate text based on patterns they learn from training data. If a model is trained on data that contains biases or is intentionally trained with data that promotes certain viewpoints, it can indeed produce outputs that reflect those biases.

With where we are with LLM frontier models I don’t think this is an issue that will stick imo

rokbenko · October 19, 2024, 2:44pm

This worries me. 99% of main LLMs in one quadrant. Data as of October 18, 2024.

Source

Topic		Replies	Views
About the LLMs category LLMs	1	147	September 9, 2024
Can't Use Opensource model for Function CrewAI Community Support	0	86	October 4, 2024
A question regarding using open source LLMs through Ollama with CrewAI CrewAI Community Support	12	1340	September 18, 2024
What are the LLMs that crewai supports, getting error LLM value is an unknown object General	10	585	March 25, 2025
Why am I getting the "Invalid response from LLM call - None or empty" error with my custom tool if using Anthropic LLM but not with OpenAI LLM? LLMs claude-sonnet-35	14	1469	December 23, 2024

A point for conversation, re: LLM's

Related topics