Optimizing AI Applications for Mobile Devices with Siddhika Nevrekar - 697

  Рет қаралды 6,185

The TWIML AI Podcast with Sam Charrington

The TWIML AI Podcast with Sam Charrington

Күн бұрын

Пікірлер: 4
@johnkintree763
@johnkintree763 5 ай бұрын
The smaller the model, the quicker the first token, the more tokens/sec, and the less electricity used. According to Andrew Ng, using an LLM in an agentic workflow can improve the performance more than increasing the size of the LLM. Also, retrieving a previous response that has been fact checked can be faster than generating a new hallucinated response.
@johnkintree763
@johnkintree763 5 ай бұрын
Qualcomm AI hub should measure watt hours of electricity used in a typical hour of operation, in addition to the time to first token of output, and tokens per second of output for each optimized model.
@johnkintree763
@johnkintree763 5 ай бұрын
Once an application works at all, it can typically be optimized to run faster, and to run on less powerful hardware such as smartphones. Developers of hardware, such as Qualcomm, should provide the means of optimizing software for their hardware.
@johnkintree763
@johnkintree763 5 ай бұрын
The next thing I would like to see is a digital agent running on my smartphone that is part of an open source and decentralized global platform for collective human and digital intelligence.
Supercharging Developer Productivity with ChatGPT and Claude with Simon Willison - 701
1:13:46
The TWIML AI Podcast with Sam Charrington
Рет қаралды 7 М.
AI Agents: Substance or Snake Oil with Arvind Narayanan - 704
53:53
The TWIML AI Podcast with Sam Charrington
Рет қаралды 4,9 М.
Cheerleader Transformation That Left Everyone Speechless! #shorts
00:27
Fabiosa Best Lifehacks
Рет қаралды 16 МЛН
Enceinte et en Bazard: Les Chroniques du Nettoyage ! 🚽✨
00:21
Two More French
Рет қаралды 42 МЛН
To Brawl AND BEYOND!
00:51
Brawl Stars
Рет қаралды 17 МЛН
The Age of Industrial Intelligence with Nakul Duggal
1:07:04
Qualcomm
Рет қаралды 1,9 М.
AI Is Making You An Illiterate Programmer
27:22
ThePrimeTime
Рет қаралды 291 М.
Automated Design of Agentic Systems with Shengran Hu - 700
59:01
The TWIML AI Podcast with Sam Charrington
Рет қаралды 1,4 М.
AI Engineering Pitfalls with Chip Huyen - 715
57:08
The TWIML AI Podcast with Sam Charrington
Рет қаралды 1,6 М.
Speculative Decoding and Efficient LLM Inference with Chris Lott - 717
1:16:02
The TWIML AI Podcast with Sam Charrington
Рет қаралды 247
About 50% Of Jobs Will Be Displaced By AI Within 3 Years
26:26
Fortune Magazine
Рет қаралды 413 М.
AI Agents for Data Analysis with Shreya Shankar - 703
47:55
The TWIML AI Podcast with Sam Charrington
Рет қаралды 3,6 М.
Top Minds in AI Explain What’s Coming After GPT-4o | EP #130
25:30
Peter H. Diamandis
Рет қаралды 989 М.
Cheerleader Transformation That Left Everyone Speechless! #shorts
00:27
Fabiosa Best Lifehacks
Рет қаралды 16 МЛН