I tried the 27B QAT model and it hallucinates like crazy. When I ask it for information about some made up person, restaurant, place name, etc., it never says "I don't know about that" and instead seems eager to just make up details.
The larger local models like the older Llama 3.3 70B seem better at this, but are also too big to fit on a 24GB GPU.
Working on adding features to my app.
I wrote BVCalc because I wanted a calculator that would show not only the numerical result value but also show the corresponding algebraic expression that I entered so that I could check for entry mistakes and more easily keep track of calculations. I also wanted this feature to work even when the calculator is in RPN mode. That is, I wanted to be able to use RPN entry, and have the calculator show both the result value and the corresponding algebraic expression (displayed using infix notation, not RPN) for each item on the stack.
I was surprised that such a feature was not available on any existing calculators and so I wrote my own. Runs on macOS, iPhone, and iPad.
I use BVCalc on iPhone/iPad/macOS since it has the best RPN mode of any calculator I have tried. It's the only calculator that shows you the algebraic expression (i.e., infix mode) next to the corresponding result value for each stack item, and it does this recursively as you perform operations on the stack! This helps you check for entry mistakes and makes it easier keep track of what each stack item represents.
I still use an old HP48GX occasionally as well as well.
I know that higher bandwidth is good for LLMs but are there actual workstation use cases where the speed outweighs the context and parameter limits imposed by 16GB of total system ram? I don't follow it that closely but it just feels to me like quality >>> quantity for laptops or mini PCs where an APU would be relevant.
RealCalc Plus is great on the Android side.
If using iPhone/iPad/macOS, try BVCalc. Its RPN mode shows you the algebraic expression (i.e., using infix notation display) for each item on the stack, which both helps you check for entry mistakes and also more easily keep track of what each stack item represents. I haven't found another RPN calculator that can do this.