antman's favorites | Hacker News

		hitpointdrew on Dec 8, 2024 \| parent \| context \| on: PostgreSQL High Availability Solutions – Part 1: J... The best way I have found is to setup keepalived -> pgbouncer -> Postgres. Use repmgr to manage replication and barman for backups. Setup a VIP with keepalived with a small script that checks if the server is primary. You loose about 7-9 pings during a failover, have keepalived check about every 2 seconds and flip after 3 consecutive failures.
		Spivak on Oct 20, 2024 \| parent \| context \| on: Understanding how bureaucracy develops Thank you! A blame focused culture rewards the least amount of risk taking, the most ass covering, and so much useless bureaucracy because you naturally accumulate systems to convert individual blame to collective blame like change review boards and multiple sign-offs for everything. Folks do the bare minimum because that's the safe subset. I'm never going back to that kind of culture, it's soul crushing.
		tlavoie on Sept 9, 2024 \| parent \| context \| on: DuckDB 1.1.0 Released This episode of Kris Jenkins' Developer Voices podcast talks with a couple authors of a new book on DuckDB, and does a great job of explaining the sorts of things that make it so unusual: https://www.youtube.com/watch?v=_nA3uDx1rlg
		lr4444lr on Aug 4, 2024 \| parent \| context \| on: Can reading make you happier? (2015) Ecclesiastes 1:18
		Tostino on July 10, 2024 \| parent \| context \| on: Knowledge Graphs in RAG: Hype vs. Ragas Analysis I really need to dig into the more recent advances in knowledge graphs + LLMs. I've been out of the game for ~10 months now, and am just starting to dig back into things and get my training pipeline working (darn bitrot...) I had previously trained a llama2 13b model (https://huggingface.co/Tostino/Inkbot-13B-8k-0.2) on a whole bunch of knowledge graph tasks (in addition to a number of other tasks). Here is an example of the training data for training it how to use knowledge graphs: easy - https://gist.github.com/Tostino/76c55bdeb1f099fb2bfab00ce144... medium - https://gist.github.com/Tostino/0460c18024697efc2ac34fe86ecd... I also trained it on generating KGs from conversations, or articles you have provided. So from the LLM side, it's way more knowledgeable about the connections in the graph than GPT4 is by default. Here are a couple examples of the trained model actually generating a knowledge graph: 1. https://gist.github.com/Tostino/c3541f3a01d420e771f66c62014e... 2. https://gist.github.com/Tostino/44bbc6a6321df5df23ba5b400a01... I haven't done any work on integrating those into larger structures, combining the graphs generated from different documents, or using a graph database to augment my use case...all things I am eager to try out, and I am glad there is a bunch more to read on the topic available now. Anyways, near term plans are to train a llama3 8b, and likely a phi-3 13b version of Inkbot on an improved version of my dataset. Glad to see others as excited as was on this topic!
		mvdtnz on April 16, 2024 \| parent \| context \| on: Descent 3 Source Code Other than the control scheme Outer Wilds really could not be more different to Descent.
		BiteCode_dev on March 25, 2024 \| parent \| context \| on: Pyenv – lets you easily switch between multiple ve... While there are no silver bullet, for beginners, I found that "Relieving your Python packaging pain" (https://www.bitecode.dev/p/relieving-your-python-packaging-p...) is the Pareto solution. That is, the solution that has the best ratio of effort, reward, but also the lower risk of failure. It's not no risk, but I've been helping beginners for 15 years with Python, and have tried everything you can think of. It does imply, on linux, to limit yourself to the choices of Python you can install. This constraint is, for most people, preferable than the alternative, even if it gets frustrating to our geeky soul.
		zakki on July 19, 2022 \| parent \| context \| on: An SPA Alternative Maybe GP wants to access the information and not the unnecessary local processing? Edit: wording
		gnicholas on Feb 7, 2024 \| parent \| context \| on: Business advice plagued by survivor bias (2009) Doesn’t ever make it to the front page? I’ve been here a decade and don’t ever remember seeing it there.
		gavinray on Feb 5, 2024 \| parent \| context \| on: Show HN: Natural-SQL-7B, a strong text-to-SQL mode... What makes you think that SQL doesn't have "for loops"? Ever heard of LATERAL joins/CROSS APPLY? `SELECT loop.value, x.squared FROM generate_series(1,5) AS loop(value) CROSS JOIN LATERAL (SELECT loop.value * loop.value AS squared) AS x;`
		doctorpangloss on Feb 5, 2024 \| parent \| context \| on: Boring Python: dependency management (2022) Every comment in that thread, including the comment you referenced, is wrong. Due to Poetry's architecture, it can't satisfy all three at the same time: - the platonic ideal of build isolation and lockfiles - installing the appropriate accelerator specific version of pytorch for the current platform non-interactively or one that the user selects - dependencies for pytorch that are transitively compatible with the way dependencies for pytorch are expressed elsewhere This is something you can achieve with setuptools and setup.py by forfeiting the platonic ideals of build isolation and lockfiles. Poetry, on the other hand, does not let you choose which lamb to sacrifice. Everyone in the thread, for the last two years, who has reported that they have had some success are misunderstanding the state of their install, and have interacted with flaws in all three situations I'm describing. They have something that will not correctly install anything that is dependent itself on PyTorch, which is useless, since everything in the PyTorch ecosystem is is dependent on it, and the main workaround the community uses - installing torch first, followed by installing dependencies from a requirements.txt, followed by copying a dump of scripts - is not compatible with poetry. Here's the facts of the matter: - 1.2M requirements.txts https://github.com/search?q=path%3A%2F%5Erequirements.txt%24... - 664k setup.pys https://github.com/search?q=path%3A%2F%5Esetup.py%24%2F&type... - setup.pys that reference requirements.txts https://github.com/search?q=path%3A%2F%5Esetup.py%24%2F+requ... 67.1k - only 30k pyproject.toml specified with dependencies https://github.com/search?q=path%3A%2F%5Epyproject.toml%24%2... 2/3rds of Python end users do not engage with packaging at all. pyproject.toml with dependencies is about 1.5% of the ecosystem. It provides only downsides compared to setup.py and pinning your package versions by commit in your setup.py dependencies aka doing what golang does, and this does not require any external tools in Python. In my opinion, the Poetry developers need to fix pytorch or they will not get adoption during Peak Python.
		NohatCoder on Jan 11, 2024 \| parent \| context \| on: AMD Reveals Next-Gen Desktop Processors for Extrem... A quick check puts a Ryzen 5 5600 and a Radeon RX 6600 at $360 combined, where the 8700G is set to $330. And a RX 6600 will deliver around double the graphics performance. So even without factoring in motherboard and memory the 8700G is hard to justify for a cheap gaming rig.
		dylan604 on Dec 11, 2023 \| parent \| context \| on: Photorealistic Video Generation with Diffusion Mod... Ha! I took a phone call back when I was in the shiny round disc business that said based on the prices we were quoting, that we'd be out of business in a year because Apple released DVD Studio Pro. Instead, we gained even more business from people that tried to have their projects done in DVDSP, but then ran into road blocks making it impossible to do the job. They'd then come back to us, and receive an even higher quote since now there is even less time before their deadlines. cie la vie. You say out of business. I say, mediocrity all the way down from here.
		filterfiber on Nov 30, 2023 \| parent \| context \| on: Is anyone using self hosted LLM day to day and tra... Yes! The obvious answer is to just increase your positions and train for that. This requires a ton of memory however (context length is squared) so most are currently training at 4k/8k and then finetuning higher similar to many of the image models. However there's been some work that to "get extra milage" out of the current models so-to speak with rotary positions and a few other tricks. These in combination with finetuning is the current method many are using at the moment IIRC. Here's a decent overview https://aman.ai/primers/ai/context-length-extension/ Rope - https://arxiv.org/abs/2306.15595 Yarn (based on rope) - https://arxiv.org/pdf/2309.00071.pdf LongLoRA - https://arxiv.org/pdf/2309.12307.pdf The bottleneck is quickly going to be inference. Since the current transformer models need the context length ^2, the memory requirements go up very quickly. IIRC a 4090 can _barely_ fit a 4bit 30B model in memory with 4096k context length. From my understanding some form of RNNs are likely to be the next step for longer context. See RWKV as an example of a decent RNN https://arxiv.org/abs/2305.13048
		filterfiber on Nov 30, 2023 \| parent \| context \| on: Is anyone using self hosted LLM day to day and tra... Yeah that's a fairly well studied one. Most of these techniques are rather "lossy" compared to extending the context window. The most likely "real solution" is going to be using various tricks and finetuning on higher context lengths to just extend the context window. Here's a bunch of other related methods, Summarizing context - https://arxiv.org/abs/2305.14239 continuous finetuning - https://arxiv.org/pdf/2307.02839.pdf retrieval augmented generation - https://arxiv.org/abs/2005.11401 knowledge graphs - https://arxiv.org/abs/2306.08302 augmenting the network a side network - https://arxiv.org/abs/2306.07174 another long term memory technique - https://arxiv.org/abs/2307.02738
		yieldcrv on Oct 7, 2023 \| parent \| context \| on: Ask HN: Any alternatives to Duolingo without gamif... ChatGPT4 ask it for a 12-week curriculum and have it go through it with you It can do languages and you can get google to pronounce things, and ChatGPT again to explain unexpected nuances when multimodal is rolled out to your profile, you can practice handwriting too
		ynik on Oct 2, 2023 \| parent \| context \| on: Python 3.12 Not for pure python code; but there's massive advantages for mixed C(++) and Python: I can now have multiple sub interpreters running concurrently and accessing the same shared state in a thread-safe C++ library. Previously this required rewriting the whole C++ library to support either pickling (multiplying the total memory consumption by the number of cores), or support allocating everything in shared memory (which means normal C++ types like `std::string` are unusable, need to switch e.g. to boost::interprocess). Now is sufficient to pickle a pointer to a C++ object as an integer, and it'll still be a valid pointer in the other subinterpreter.
		AbrahamParangi on Sept 30, 2023 \| parent \| context \| on: Mistral releases ‘unmoderated’ chatbot via torrent The harm of kitchen knives, fire, electricity, thinking, and getting out of bed all clearly exceed 0. This suggests to me that it's fundamentally wrongheaded to think of it as the primary metric by which we evaluate things.
		KRAKRISMOTT on Sept 25, 2023 \| parent \| context \| on: Our humble attempt at “how much data do you need t... Why not use the finite state machine logits filter trick for controlling output format? It seems to work pretty well for JSON
		bravetraveler on Aug 28, 2023 \| parent \| context \| on: Cleaning Up Dead Bodies in AWS IAM Certainly! In this case, they were interested in files that were too permissive. I don't have a good example of the command, but it was basically looking for 'worldly' permissions that were too open. It's important to note the users/groups could be discarded/ignored. They were using 'find ... -exec ls -ld {} \;', which does an LDAP lookup on each result to resolve UIDs and GIDs to names. They could have made the process far more efficient with either the native '-ls' argument built into find, or adding '-n' to the exec'd 'ls' Either would skip the name resolution/domain. At a certain number of results/files the expense is too high, causing the job to time out
		DeRock on Sept 2, 2021 \| parent \| context \| on: Experiments on a $50 DIY air purifier (2020) Heres my set up: - 20 inch box fan - 20x20x4 MERV12 filter. The 4-inch pleating is key here, to reduce the air resistance on the box fan. - 1 inch dust pre-filter. This is course, low air resistance, and is for increasing the life of the more expensive MERV12/HEPA filter (so it doesn't get clogged with easy to filter dust). - Both filters are on the intake side of the box fan. This means you don't need a bungee cord because the intake has negative pressure, the filters just "stick". It also means you keep your box fan flowing with only cleaned air. Heres a pic of my setup: https://imgur.com/a/O4I6Udx And here's a graph this morning from my air quality sensor, after turning on the fan/filter: https://imgur.com/a/c5GMNcC The peak at ~30ug/m^3 PM2.5 is approximately 100 AQI. The fan was on the other side of my 900 sqft house from the sensor.
		zwkrt on Aug 31, 2021 \| parent \| context \| on: The art of not taking things personally As I get older, the less I identify as my current state and the more I identify with the person who transitions through states. My change in perspective has reduced my anxieties and anger significantly. "This too shall pass" and all that. The more of my self image is focused on superficial things, the more I will take things personally. What we are angry about tends to be a reflection of ourselves more than the current state of affairs. If I see myself as a busy professional I might be much more aggravated by someone at the grocery store holding up the checkout line with EBT (since I am busy they must be lazy!). If I see myself as a social climber I will always be worrying if people are using me for something (since I am using them!). If I identify with my wealth I might develop some neurosis regarding the sight of the homeless (since they represent ultimate failure!). I don't believe in reincarnation but it is a helpful thought experiment to think about what benefits and drawbacks your particular incarnation of life holds and how those might be different if you were incarnated elsewhere.
		TeMPOraL on Aug 30, 2021 \| parent \| context \| on: The open calendar, task and note space is a mess > Why can’t I just click a button and follow all the local events for my favorite artist/venue/team/theater/charity? Why do I still have to manually create events and copy and paste details? Why are there so few “add to calendar” and why do they often fail? It's because of advertising. Artists, venues, teams, theaters, charities and other performers may themselves not care - they make money on you being there. But everyone else between you and the event, they make money on your attention. They explicitly don't want you to streamline or automate event discovery and attendance scheduling. They want you to visit their pages, and be exposed to ads (whether it's regular ads where they get paid per click, or ads for different events with which they make money on you deciding to attend). The "attention economy" is, by definition, built on making everything inefficient and full of hassle. To monetize attention, you have to make the users pay it first. This may be an tired point, but I feel it bears repeating. The problem isn't one of technology - the tech for what you want exists, and actually worked much better in the past. The problem is businesses: they actively don't want you to use the Internet in this way.