I can't believe this... Phi-3.5-mini (3.8B) running in-br... | I can't believe this... Phi-3.5-mini (3.8B) running in-br...
I can't believe this... Phi-3.5-mini (3.8B) running in-browser at ~90 tokens/second on WebGPU w/ Transformers.js and ONNX Runtime Web! ๐Ÿคฏ Since everything runs 100% locally, no messages are sent to a server โ€” a huge win for privacy!
- ๐Ÿค— Demo:
webml-community/phi-3.5-webgpu

- ๐Ÿง‘โ€๐Ÿ’ป Source code: https://github.com/huggingface/transformers.js-examples/tree/main/phi-3.5-webgpu transformers.js-examples/phi-3.5-webgpu at main ยท huggingface/transformers.js-examples