The Phi series never fails to surprise me, combined with ONNX runtime its really portable and powerful. I'm using Phi-3.5 instruct at the moment for enterprise clients and its performing very well, Looking forward to adapting the vision model into the mix too. Fantastic work MSR team, keep up the amazing work! Small, Smart and Scalable for the win! 🚀
@quickpert13822 ай бұрын
a realistic voice decoder along that image encoder is all we need in rest. Hope meta guys are not going to be late at the small vision models party.
@WearyTimeTraveler2 ай бұрын
The phi models are truly impressive, excited to see the future work around embodiment. Only hope in future is that frozen weights at different training stages are available to download
@GNARGNARHEAD2 ай бұрын
open source, lets go!
@sammcj20002 ай бұрын
Microsoft hasn’t contributed in the most widely used format (GGUF) though meaning unless the community does the work it won’t be usable in common tooling such as llama.cpp, Ollama etc
@ChristianNode2 ай бұрын
what do you mean @@sammcj2000
@ahmedtremo2 ай бұрын
Great and concise explanation, thanks!
@n8works2 ай бұрын
This was a detailed and interesting video. Congrats on the achievement.
@renereiche2 ай бұрын
Phi-3 is absolutely incredible, super capable and yet resilient to misuse and always kind and understanding. Magical at this size already and then it's even good at math. However, I think Microsoft should cut the parameter sizes of the different versions more smartly in regards to current device hardware.
@markmatzke2 ай бұрын
Fantastic presentation! I’m particularly interested in how the F3 Vision model's performance compares to other vision-language models in terms of scalability for different hardware platforms. It seems like a game-changer for integrating vision capabilities with language understanding. Also, how do you see the model evolving to address emerging challenges in diverse data contexts? Looking forward to seeing its future applications and updates!
@tamineabderrahmane2482 ай бұрын
phi-3 vision has the same structure of PaliGemma , and both are open sourced , great !
@p4r7h-v2 ай бұрын
brilliant
@ChristophBackhaus2 ай бұрын
SO how well does this for extraction from pdfs in comparison to OCR?
@r.m81462 ай бұрын
awesome
@YiKidane2 ай бұрын
specswriter AI fixes this. Highly capable small vision model.
@sammcj20002 ай бұрын
Needs a GGUF!
@octaviusp2 ай бұрын
How can i join the microsoft research team? that's one of my life-goals, and i will reach it.
@fahnub2 ай бұрын
microsoft catchin up
@getasmilefix2 ай бұрын
LFG
@edi.maulana2 ай бұрын
okay great, but i have to turn on subtitle now.
@bilalazhar44952 ай бұрын
The fucking contrast of the text transparency looks straight garbage microsoft needs to fire all the Modern art majors on their design team in the next layoff round