*> eventually your edge hardware is going to be able to infer a lot faster than ...

a_t48 · on July 16, 2024

GP is likely referring to network latency here. There's a tradeoff between smaller GPUs/etc at home that have no latency to use and beefier hardware in the cloud that have a minimum latency to use.

yjftsjthsd-h · on July 16, 2024

Sure, but if the model takes multiple seconds to execute, then even 100 milliseconds of network latency seems more or less irrelevant

datameta · on July 16, 2024

Comms is also the greatest battery drain for a remote edge system. Local inference can allow for longer operation, or operation with no network infra.