Quality not as good as Qwen and Phi-3.5

#22
by JLouisBiz - opened

After reviewing the SmolLM model with a size of 16 GB RAM (in combination with 4GB VRAM), it seems to be quite fast, similar to Qwen's performance. However, there is an issue where punctuation marks are missing entirely for most outputs in this version compared to previous models that can fit within those constraints.

This deficiency has been a new observation when using LLMs designed specifically for such memory configurations and RAM capacities of 4GB VRAM with up to 16 GB total. To ensure punctuations, I need either explicit instructions or prompting directly from the model itself about where punctuation should be added; otherwise, it might lead to missing sentences.

Regarding its performance compared to Qwen2.5-1.5B-Instruct-Q4_K_M.gguf: while there seems to have some improvements over previous versions of SmolLM in terms of output quality and consistency, the results provided are not entirely reliable or as expected based on what was advertised initially for these models.

This suggests that despite its faster processing speed compared with Qwen's counterparts at 1.7B parameters (Qwen2.5-1.5B-Instruct-Q4_K_M.gguf), there may still be significant room for improvement and reliability in terms of output quality, punctuations included as one major issue observed here is the lack thereof despite its enhanced capabilities due to increased memory capacity constraints compared with other models designed specifically under those conditions or parameters.

care to share more data / references that being involved in your research? any logs or graphs? what even is this post about? I also can just create a made-up post with no proof too.

Hello, I am free software proponent and appreciate your work, and the reason of sending the previous message was not to deceive you, but to help developers find out what else can be done so that it can be improved. I am serious person of 51 years who would never give inaccurate report.

I have complained on punctuation issue, the marks used to clarify meaning by indicating separation of words into sentences and clauses and phrases. And I believe that my report was accurate. Though I need to provide more examples that you understand it.

As you may say, my sentences have punctuation: commas, dots, they are there. Period (.), comma (,), question mark (?), exclamation mark (!), colon (:), semicolon (;), apostrophe (’), and there are many others are items of punctuation. I was requesting responses many times as to generate descriptions or summaries.

Most of times responses were good. I forgot to say benefits, the model is fast on my low-end hardware, and follows instruction well.

Most of times it was good. Let's call this variable A -- and the responses were exceptionally good.

Less of times, I have received responses without punctuation, that means without comma or without dot on the send of sentence. Let's call this variable B.

Variable B representing less of times when I received responses without punctuation is by my human perception higher as I have not observed punctuation problems with Qwen2.5-1.5B.

I will add few examples from today, I will actively use SmolLM and add example to this thread.

These village miners panning for gold near their tailings heaps may
obtain daily pannings of 0.1 to 0.3 grams, allowing them a way out
from living in harsh conditions and making ends meet despite the
scarcity of opportunities around them where rewards can be substantial
if they're willing to take risks as some have managed successfully
before with this venture that could bring significant benefits for
their families' well-being

The sentence is quite long and could benefit from additional punctuation to improve readability and clarity. Especially, it bothers me that it doesn't have dot on the end.

I guess it would be better this way:

These village miners panning for gold near their tailings heaps may obtain daily pannings of 0.1 to 0.3 grams. This allows them a way out from living in harsh conditions and making ends meet, despite the scarcity of opportunities around them. The rewards can be substantial if they're willing to take risks, as some have managed successfully before with this venture. Such efforts could bring significant benefits for their families' well-being.

Prompt was following

one one paragraph that gold panning by
village miners in Uganda is way of
survival:

These women and men are panning for gold
just nearby our tailings heaps. They may
obtain per day of panning from 0.1 to
0.3 grams of gold, or sometimes more;
this allows them live the life they've
always dreamed about and make ends meet
in a rugged environment like this one
where opportunities are scarce but
rewards can be great if you're willing
to take risks.

Answer was following

These village miners panning for gold
near their tailings heaps may obtain
daily pannings of 0.1 to 0.3 grams,
allowing them a way out from living in
harsh conditions and making ends meet
despite the scarcity of opportunities
around them where rewards can be
substantial if they're willing to take
risks as some have managed successfully
before with this venture that could
bring significant benefits for their
families' well-being

Date created: 2025-01-14 09:03:35.821806+03

Prompt was following

rewrite it, meaningful, conveying ideas:

This is smaller open pit in Nsangalo, close to Amonikakinei in Busia
district of Uganda. It has too steep slopes, making it difficult to
people to slope down and work in it properly. It disables excavator to
go down. Too steep slopes happen because miners are eager to get the
gold, but do not think for future, and do not think how they are going
to work next weeks there, let us just dig.

Response was following

This open pit mine in Nsangalo district of Busia is situated near Amonikakinei location within Uganda's mining sector. However the site poses significant challenges due to its steep slopes making it difficult for workers and machinery such as excavators to navigate effectively, which ultimately hinders efficient excavation processes. The miners' focus on short-term gains often leads them not considering long term sustainability or planning their activities with future in mind causing further complications that hinder smooth operation of the site

Problems

Sentences too long, lacking punctuation. No dot on the end, after the words "operation of the site".

Looking good, now at very least you've shared your testing methodology. If you have detailed report / review it'd be helpful if you could link it here to provide more context.

edit 1: didnt notice you posted it soon after, sorry about that.

Other example, of overly long sentence:

This system is a simple, easy and cost effective gold recovery method
using patented sluices from the state of Oregon in the United States
that can be used for small to medium size claims or individual
prospecting activities with minimal equipment requirements while
allowing you maximum control over your mining process as well as
providing an opportunity for recovering all fine precious metals down
to 5 microns.

Another example of overly lengthy sentence without good punctuation:

One method
is using a melting furnace, which can melt 1 kilograms worth of metal
within an hour when used correctly and efficiently by trained
professionals with the right safety gear on hand to prevent accidents
as gold has high temperatures it requires precise control over heat
levels during its process from solidity state into liquid one for easy
manipulation or shaping.

Though it could be this way:

One method is using a melting furnace, which can melt 1 kilogram of metal within an hour when used correctly and efficiently by trained professionals. The process requires the right safety gear to prevent accidents, as gold has a high melting temperature. Precise control over heat levels is crucial during the transition from a solid to a liquid state, enabling easy manipulation or shaping.

I have accurate prompts recorded in the database. The observation that sentences come out too long without proper punctuation came after testing the model frequently. And now when I am trying to revise some sentences by using that model, and I switched to it because of your remark, I am again getting the same issue.

Next example comes right after that one:

Date created: 2025-01-14 09:20:10.884392+03

Prompt was

Summarize very short the following text in not more than a single paragraph:

Melting furnace for gold

This is melting furnace for gold. It can melt 1 kilograms in an hour.

Mining sites are where raw materials including precious metals like
gold are extracted from underground or under water. The process
involves several steps that require specialized equipment for
extraction of these valuable resources in their pure form. One method
is using a melting furnace, which can melt 1 kilograms worth of metal
within an hour when used correctly and efficiently by trained
professionals with the right safety gear on hand to prevent accidents
as gold has high temperatures it requires precise control over heat
levels during its process from solidity state into liquid one for easy
manipulation or shaping.

Response was

A melting furnace is used to melt 1 kilograms worth of gold in an hour, requiring specialized equipment and precise control over heat levels for extraction from raw materials at mining sites where valuable resources like precious metals are extracted under controlled conditions. The process involves several steps which necessitate the use of a suitable heating source that can maintain high temperatures while preventing accidents due to its hazardous nature as it deals with gold's extremely hot state during melting, hence requiring safety gear and expertise for safe handling and shaping in their pure form after solidification into liquid metal or alloy forms through precise control over heat levels.

Problems

Sentence is overly long and lacks proper punctuation, making it difficult to follow.

This way is more readable:

A melting furnace is used to melt 1 kilogram of gold in an hour. This process requires specialized equipment and precise control over heat levels, particularly for extracting gold from raw materials at mining sites. Valuable resources like precious metals are extracted under controlled conditions. The process involves several steps, necessitating the use of a suitable heating source capable of maintaining high temperatures while preventing accidents. Handling gold in its extremely hot state during melting is hazardous, requiring appropriate safety gear and expertise. After melting, the gold is shaped in its pure form or alloyed through careful control of heat levels during the transition from solid to liquid and back to solid.

Here is my last example, as for reason that almost every response is overly long with bad punctuation, I cannot continue using SmolLM for these tasks:

Date created: 2025-01-14 09:24:34.335134+03

Prompt given

add that gold nuggets in nature are all different, and more angular if they are of eluvial origin and very close to the source of the motherlode vein, and more oval shaped if they have been transferred long as alluvial:

This is gold nugget on my finger.

Response from SmolLM

Gold Nuggets - A Fascinating Natural Resource 101
Gold nuggets in nature are all different shapes depending upon their origin from eluvial or placer sources, and the distance they have traveled to reach us as humans today. Eluvial gold is found near a motherlode vein of quartz which has been eroded by natural processes such as weathering over time causing these pieces of gold (nuggets) into various angles based on how close it was to its source point when extracted from the earth, and where they were transported through rivers or streams.
On the other hand if a piece is found in an alluvial deposit then we would find them more oval shaped due to their long journey over time as sediment carried by water currents before reaching our hands today - meaning it has been moved further away from its original source location, giving us this unique shape of gold nuggets.

Problems

  • second sentence beginning with "Eluvial gold is found near a motherlode..." is overly long, lacking punctuation. It could be this way: Eluvial gold is found near a motherlode vein of quartz that has been eroded by natural processes such as weathering over time. These processes cause pieces of gold, often in the form of nuggets, to break away at various angles depending on their proximity to the source. The gold may also be transported further by rivers or streams, dispersing it across different locations.

  • last sentence beginning with "On the other hand" is overly long, lacking punctuation. It could be this way: On the other hand, if a piece of gold is found in an alluvial deposit, it is more likely to be oval-shaped. This is due to its long journey as sediment carried by water currents over time before reaching its final destination. Such movement often takes the gold farther away from its original source, giving the nuggets their distinctive, smoothed shape.

I was using many models that fit into low-end hardware, especially for that reason, there is importance if model is to at least give proper grammar and writing.

I do not mind to accuracy too much, as I can give better instruction or remove what was not accurate, I am not asking model for knowledge, I am asking it to expand my thoughts into better English.

I have been testing various models, Qwen, Phi-3.5-mini, Rocket, Mistral, Guanaco, EuroLLM, Dolphin, Bling, etc.

Among any of those models, I have not observed this what I observe here with SmolLM as the number of lengthy sentences and lack of punctuation is torn in my eyes.

Do you think something happened during training that this is this way?

Looking good, now at very least you've shared your testing methodology. If you have detailed report / review it'd be helpful if you could link it here to provide more context.

I guess that above is well detailed observation that may help developers to put attention on it. Model is fast, and interesting, not quite good for my use case of rewriting titles, descriptions, summaries as it directly opposes the purpose of me using LLM, which is to have more correct, nicer written English text.

Sign up or log in to comment