More

zerop · 2025-11-10T17:04:38 1762794278

Microservices created the need for more teams, devs, coordination, meetings, scrum teams, scrum master, Jira, CI CD tools, cloud market and so on.. It is very well planned :)

9rx · 2025-11-10T17:27:37 1762795657

It was — until developers started shouting "You don't need microservices. Just build a monolith."

Then, uncoincidentally, they started crying about how they couldn't find work anymore.

drob518 · 2025-11-10T17:38:55 1762796335

There are a lot of people who shouldn’t be doing software engineering.

9rx · 2025-11-10T17:44:05 1762796645

2015:

- "I'm struggling to find work and need to make money. What can I do?"

- "Learn to code, good buddy! Software engineering solves all problems."

2025:

- "I learned to code and am still struggling to find work and need to make money."

- "You shouldn't be doing software engineering."

drob518 · 2025-11-10T17:56:11 1762797371

Yep, exactly.

9rx · 2025-11-10T17:56:43 1762797403

Exactly.

zerop · 2025-03-27T17:56:55 1743098215

The explanation of "hallucination" is quite simplified, I am sure there is more there.

If there is one problem I have to pick to to trace in LLMs, I would pick hallucination. More tracing of "how much" or "why" model hallucinated can lead to correct this problem. Given the explanation in this post about hallucination, I think degree of hallucination can be given as part of response to the user?

I am facing this in RAG use case quite - How do I know model is giving right answer or Hallucinating from my RAG sources?

kittikitti · 2025-03-27T18:17:06 1743099426

I incredibly regret the term "hallucination" when the confusion matrix exists. There's much more nuance when discussing false positives or false negatives. It also opens discussions on how neural networks are trained, with this concept being crucial in loss functions like categorical cross entropy. In addition, the confusion matrix is how professionals like doctors assess their own performance which "hallucination" would be silly to use. I would go as far to say that it's misleading, or a false positive, to call them hallucinations.

If your AI recalls the RAG incorrectly, it's a false positives. If your AI doesn't find the data from the RAG or believes it doesn't exist it's a false negative. Using a term like "hallucination" has no scientific merit.

esafak · 2025-03-27T19:51:19 1743105079

So you never report or pay heed to the overall accuracy?

miraculixx · 2025-03-31T20:18:51 1743452331

"Hallucination" is just to term we use to say "this result is not what it should be". The model always uses the very same process, it does not do one thing for "hallucinations" and something else for "correct" results.

In a nutshell it is always predicting the next token from a joint probability distribution. That's it.

All other interpretations are speculative.

pcrh · 2025-03-27T21:59:43 1743112783

The use of the term "hallucination" for LLMs is very deceptive, as it implies that there is a "mind".

In ordinary terms, "hallucinations" by a machine would simply be described as the machine being useless, or not fit for purpose.

For example, if a simple calculator (or even a person) returned the value "5" for 2+2= , you wouldn't describe it as "hallucinating" the answer....

astrange · 2025-03-28T21:22:52 1743196972

"Hallucination" happened because we got AI images before AI text, but "confabulation" is a better term.

zerop · 2025-03-18T15:40:59 1742312459

Currently exploring cube for a "natural language to SQL" solution.

My schema is - 90+ Tables, 2500+ Columns, well documented

From your experience, does Cube look a fit? My use cases will definitely have JOINS.

mritchie712 · 2025-03-18T18:12:19 1742321539

yes, that shouldn't be a problem.

with that many tables, you might want to use Views: https://cube.dev/docs/reference/data-model/view

zerop · 2025-03-18T18:24:08 1742322248

Thanks. sorry, asking more question - Do we need human in the loop with Cube to define the views for all kinds of queries.

In my use case, it's going to be exposed to various kind of stakeholders and there will be versatility of user queries. I can't pre-create views/aggregations for all scenarios.

zerop · on Jan 1, 2025

How people are unearthing knowledge from ancient libraries using Claude

https://x.com/PrintedPathways/status/1865637416637231406

zerop · on Dec 26, 2024

This looks great, thanks for building this.

Something on similar lines which many may link, Research Rabbit - https://www.researchrabbit.ai/

Quizzical4230 · on Dec 26, 2024

I am glad you liked it!

I wanted PaperMatch to be open-source so that the users can understand the workflow behind it and hack it to their advantage instead of grumbling away when the results aren't to their liking.

zerop · on Dec 18, 2024

Can we give reference of these articles to LLMs and get them to write articles like this for educational contents and produce similar WebGL graphics code to render images. I mean, just use this style and produce educational content using AI. that might make the studies more interesting.

The5thElephant · on Dec 18, 2024

I'm guessing it will get nowhere close to as well considered, written, and structured as what Bartosz makes himself.

I don't know how people don't see how poor quality so much AI writing is, even when referencing good quality work.

Also making effective visualizations that do a good job of illustrating a concept is not just a matter of being able to write the code.

zerop · on Dec 18, 2024

Can we give reference of these articles to LLMs and get them to write articles like this for educational contents and produce similar WebGL graphics code to render images. I mean, just use this style and produce educational content using AI. that might make the studies more interesting.

zerop · on Dec 16, 2024

I fear that in the goal of going from "manual coding" to "fully automated coding", we might end up in the middle, where we are "semi manual coding" assisted by AI, which would need different software engineer skill.

zerop · on Dec 15, 2024

Is it possible to build similar to anthropic computer use feature with Qwen vision model.

Someone open sourced it with langchain

https://x.com/1littlecoder/status/1856397375704576399

leobg · on Dec 15, 2024

Browser use is very easy. Can even do that headless. That way, you can also do bulk processing. For a client, I did some 16k websites with a simple LLM agent. With “computer use” how long would that take, and what would it cost? For me, it was ~$20 (I used Gemini for this task).

zerop · on Dec 13, 2024

Great, congrats on your launch.

1. Does it take care of Bot detection. Most sites will have it.

2. Is this something similar to Firecrawl - https://www.firecrawl.dev/

sachou · on Dec 13, 2024

Yes, it has an extensive proxy IP and retry system in place to bypass bot detection.

I’m also trying to gather more feedback to identify the killer feature:

- Adding vectorization to Pinecone out of the box? - Adding multiple integrations like n8n, etc.?

Any crucial pain points to avoid?

gregoryl · on Dec 13, 2024

Are you concerned about making a product that does this? The legal aspect of accessing a computer system that is intending to block your use seems worrisome.

sachou · on Dec 13, 2024

It is the responsibility of the user. Everyone should be responsible for their own actions. We still allow knives to be sold, and most people use them for good.

carpenecopinum · on Dec 13, 2024

Now imagine that knife stabbings became so common that almost everyone started wearing body armor and you start selling body armor defeating knives explicitly. I can honestly see why most people would be upset about that.

gregoryl · on Dec 13, 2024

I don't see that as a good analogy. There's very limited space for this functionality to be used legitimately / legally - anyone permitted to scrape content is likely able to access the data without the protection measures in the way.

I'm fairly sure circumvention is a (prosecuted!) crime in several countries - curious if you're across that angle, and/or have legal advice/direction you can share?