More

lachyg · 2025-07-03T16:50:43 1751561443

Congratulations! This looks really great. What've you found to be the best hands / end effectors these days? When do you think we'll have good, reliable 5 finger hands that are ~reasonably priced?

codekansas · 2025-07-03T16:52:48 1751561568

I'm not convinced that 5 finger hands are necessary right now, but there is a really long tail of hand suppliers that we've been exploring to help get the price down.

I think at volume the price for a good set of hands should settle somewhere around $300-500. Most of it comes down to meeting suppliers where they're at and negotiating mutually beneficial deals. It's not magic but it does require having a good understanding of the hardware in order to negotiate well.

mhb_eng · 2025-07-03T17:59:36 1751565576

Have you identified any limitations in current grippers based on lack of tactile sensing solutions to unlock truly dextrous manipulation?

codekansas · 2025-07-03T18:08:42 1751566122

Actually yea, the benefit of our parallel gripper is that we get some proprioceptive feedback which we can't get from the current 5 finger hand. I'm not sure how important this will be long term - I think vision can eventually mostly compensate if the ML models are good enough

lachyg · 2025-06-18T16:43:44 1750265024

(I work at Pi.)

We open-sourced Pi0 (referenced in this post): https://github.com/Physical-Intelligence/openpi

lachyg · 2025-04-22T18:10:20 1745345420

Off the shelf robots -- we've got our models running on dozen+ different robot types (and have this specific generalization demo working on multiple platforms too.)

gs17 · 2025-04-22T18:16:46 1745345806

Great, would you happen to know what's used in this video?

modeless · 2025-04-22T18:20:41 1745346041

Here are some of the suppliers for things seen in the videos:

https://arx-x.com/

https://x.com/GalaxeaDynamics

https://www.youtube.com/@HEXMOVEHexmove_Robotic

https://www.trossenrobotics.com/

npodbielski · 2025-04-23T14:52:14 1745419934

So you are saying I can buy some robot, GPU and have this robot fold my laundry? How much? :D

lachyg · on Nov 12, 2024

I'll bite. I _worked_ at Stripe. Stripe has no authority in decision making here, the issuing bank decides (eg the customers bank) who wins and losses chargebacks. Stripe is a conduit of information, not a party to the decision.

lachyg · on Oct 31, 2024

(I work at π.)

Happy to answer any questions on the model, hardware, etc

golol · on Oct 31, 2024

I saw your foundation model is trained on data from several different robots. Is the plan to eventually train a foundation model that can control any robot zero shot? That is, the effect of actuations on video/sensor input is collected and understood in-context and actuations are corrected to yield intended behavior. All in-context. Is this feasible?

More specifically, has your model already exhibited this type of capability, in principle?

dr_dshiv · on Oct 31, 2024

Nearly 2 years ago I bet a roboticist $10 that we’d have “sci-fi” robots in 2 years.

Now, we didn’t set good criteria for the bet (it was late at night). However, my personal criteria for “scifi” are twofold: 1. Robots that are able to make peanut butter sandwiches without explicit training 2. Robots able to walk on sand (eg Tatooine)

Based on your current understanding, who won the bet? Also, what kind of physical benchmarks do you associate with “sci-fi robots”?

modeless · on Nov 1, 2024

Coincidentally just saw robots walking on sand today: https://www.youtube.com/watch?v=KRVR0E7AN0A

timmg · on Nov 1, 2024

You did not win the bet :)

guessmyname · on Nov 1, 2024

Is there a web page where we can see bloopers? I want to see the problems you had to solve.

Also, could you please consider adding googly eyes [1] to the robot(s) in future videos?

[1] https://en.wikipedia.org/wiki/Googly_eyes

nooumenon · on Nov 1, 2024

Hi! Very cool results. Are you able to share some numbers about the slope of the scaling curve you found, i.e. how performance responds to a growing nr of demonstrations?

Academically I'd also be very interested how much of a data efficiency improvement you achieved with the pretrained model + task specific post-training versus from-scratch task specific training - like, if post training requires say 50 additional demos, and from-scratch on smaller model requires say 250 demos (or whatever) to match performance, that would be an interesting quntification of the efficiency benefit of using the big foundation model

narmiouh · on Oct 31, 2024

First of all - incredible work. Do you guys plan to integrate frameworks like ROS to help manage this robot?

amelius · on Oct 31, 2024

How does the post-training step work? In the case of t-shirt folding, does a supervisor perform the folding first, many times? Or is the learning interactive, where a supervisor corrects the robot if it does something wrong?

neaanopri · on Nov 1, 2024

As a committed AI skeptic, this demo is very impressive. Bravo

lachyg · on Nov 9, 2023

(i'm an investor in the company, and invested over 3 years ago.)

this product has always been about AI—what they launched is almost exactly what they pitched me. their expectation of where the world going ended up being prescient.

gizajob · on Nov 10, 2023

Call them tomorrow and ask them to fire whoever caused the AI to hallucinate on their tech demo and give a wholly incorrect answer to the question of "where is the next eclipse". Get them to fire whoever didn't check that video for accuracy while you're at it.

user_named · on Nov 10, 2023

That would be Sam Sheffer. Yeah he should get fired

polynomial · on Nov 11, 2023

It really is a terrible video.

progne · on Nov 9, 2023

Why this was done as a separate piece of hardware rather than a phone app? Was it the projector that attracted you to the project?

lachyg · on Nov 9, 2023

(i'm an investor in the company.)

this is probably the most privacy-forward hardware device on the market—you have to physically be making contact with the device for it to begin listening (at which point an LED is prominently visible) and it will stop listening as soon as you break contact.

kkielhofner · on Nov 9, 2023

Is this done physically/electrically?

Echo devices, for example, were sold as having a "hardware mute switch" from day one. Sure enough, teardown after teardown[0] has confirmed the hardware mute switch actually physically disables the mic (cuts power to the ADC, mic lines, etc).

If this is implemented in software it's no different than a phone and worse than an Echo.

[0] - https://electronupdate.blogspot.com/2021/01/amazon-echo-flex...

friend_and_foe · on Nov 9, 2023

Physically make contact as in, tap it with your hand TNG style? Or worse, hold contact with your hand? How do you project the laser display and talk without using both hands? Is the hand criss crossing difficult I'm this situation?

An LED comes on... Is it bright? Can you see it direct line of sight from your eye to your shirt without fussing with it? Is the LED just there for others to know the owner isn't recording the conversation?

Can the camera passively watch in hardware with a firmware update? Is the physical contact for audio capture in hardware or software?

geoffeg · on Nov 9, 2023

Honestly, I don't think the concern is this particular product or company, even if they can truly adhere to a privacy-first policy. For me the consideration is a slow erosion of privacy from any company or product. For instance: twenty years ago the idea that someone could quickly take a discreet high-quality video with something in their pocket wasn't possible. Smart phones made that possible, then we see things like Google Glass and now these accessory pendant devices will make it even easier. To be clear, I'm not against things like the Humane and Rewind pendants, I'm just curious about how they will impact society, especially considering how quickly we're moving without putting much thought into their impacts.

rapjr9 · on Nov 10, 2023

A lot of people are saying that using this device would require speaking all kinds of private things out loud in public, but people would likely alter their behavior and use of the device in public. The nature of the questions they ask would be different, they would self-censor. In private they'd use it differently. People don't watch porn on their phone in the subway (mostly) and they wouldn't state their credit card info out loud using this on a subway either. If you have to say "take a photo" then the people around you know you are taking a photo. If it can record video it should beep occasionally or something. I still don't have a complete idea of how the UI works though, can the projector project onto a wall instead of your hand? Can you listen to replies via wireless earbuds? Would like to see something more in depth about how to use it and what can be accomplished with it.

sebastiennight · on Nov 9, 2023

Thanks for your reply.

However,

- he's not touching it during the phone call - it's not super clear in the demo when he's saying "your engagement comes through your voice, touch, gesture, or the lasering display"

How do you engage through (a) voice or (b) gesture then?

abeyer · on Nov 9, 2023

Even if that's true, if it sees any success it will both normalize that type of device in public, and very shortly see aliexpress flooded with a bunch of cheap clones from companies with no such beliefs.

notahacker · on Nov 9, 2023

tbf, that precedent went a long time ago when most people got powerful computers with sophisticated voice recording capabilities in their pockets and even on their wrist...

daavid13 · on Nov 9, 2023

That might be nice, but this simply refers to the information capture window, how about the bigger problem being all your data being beamed to OAI servers ?

busssard · on Nov 13, 2023

privacy forward hardware for the sensors. But i am assuming all input to the model can be used for training? Just as it is current standard for any AI assistant? So my conversations, my calendar every thing will be open to one entity. Otherwise it would not be able to condense them into a summary. i dont have privacy anymore if the entity reads everything. Even if my friends do not consent to their texts being fed into the AI as input. This is a privacy issue for me, regardless of me allowing it to activate camera or microphone. Nothing stops a hacker with access to query the entity to spit out all users that have a depressed sentiment analysis of their texting history etc...

verdverm · on Nov 9, 2023

Have you tried the device?

Are you using it daily?

Have you changed your habits?

gizajob · on Nov 9, 2023

*everyone else’s privacy

ginko · on Nov 9, 2023

>it will stop listening as soon as you break contact

..or so they say

uxhacker · on Nov 9, 2023

Do you know when it will launch in Poland or Europe? I want one.

I often have to deal with communicating with people in other languages this would save me so much time.

lachyg · on April 21, 2022

I believe they're looking to imply here that they own their own charter, rather than renting someone else's, which is how almost all U.S. fintech companies operate (look in the website footer of, say, Unit and you'll see: "Banking services are provided by Unit's partner banks who are Member FDIC.")

fierro · on April 21, 2022

this is an important point. There are other providers who offer programatic creation of bank accounts, payments, etc. But all existing solutions wrap a bank, who then wrap middleware providers and core systems. When you work with Column, you're working with only Column. This has implications for cost (fewer people taking a slice of the pie), performance/usability/experience (modern, tightly integrated systems), and development velocity (fewer players in the game of Telephone). Column collapses the layers of the financial services stack and exposes this functionality via API.

bogwog · on April 21, 2022

I assume that means Column can offer lower costs, or something?

phphphphp · on April 21, 2022

I’m sure costs is part of the equation but I’d imagine the control is far more important. Reselling legacy bank services means you’re limited to what they can do, which is usually not much. Most finance technology is heavily limited by what they can do… because of their partners, and that’s why they’re usually just nicer interfaces to the same old services. Hence banks like Monzo in the UK building their own infrastructure from the ground up too. The less you’re dependent on legacy technology, the more you can do.

lachyg · on June 17, 2021

Stripe Reader may be a more accurate link: https://stripe.com/terminal/stripe-reader. This is Stripe announcing its own own hardware that developers can use to build their own point of sale payments experiences.

lachyg · on July 21, 2020

Is contributing to the fund a taxable contribution? I would assume so since it's a diversifying event.

geoburke · on July 21, 2020

Contribution is done through a note. Not taxable until the transfer of shares upon some liquidity/exit event in the future.

beambot · on July 21, 2020

So it's a forward contract...?

geoburke · on July 21, 2020

Yes, we see that as most frictionless way to accomplish this at scale

lachyg · on July 21, 2020

Is there more that can be read on this somewhere? How it's taxed, what're the implications, how is it structured?

csentropy · on July 21, 2020

Great questions, it will be on our FAQ page