EDGE AI POD
Discover the cutting-edge world of energy-efficient machine learning, edge AI, hardware accelerators, software algorithms, and real-world use cases with this podcast feed from all things in the world's largest EDGE AI community.
These are shows like EDGE AI Talks, EDGE AI Blueprints as well as EDGE AI FOUNDATION event talks on a range of research, product and business topics.
Join us to stay informed and inspired!
EDGE AI POD
Cows Chewed Our Sensors And Still Taught Us About Edge AI
Use Left/Right to seek, Home/End to jump to start or end. Hold shift to jump forward or backward.
A failed 5G rollout in a legendary forest forced us to rethink everything we knew about AI infrastructure. Instead of pushing data to distant servers, we turned wearables, sensors, and tiny controllers into a cooperative network that can sense, decide, and act without the cloud. The result is a hands-on tour of decentralized AI: how to split models across devices, why feature fusion matters more than raw horsepower, and what it takes to make ad hoc networks reliable in the wild.
We walk through practical patterns for collaboration at the edge, from complementary sensing in search-and-rescue to pooled compute in crowded venues. You’ll hear how we orchestrate parallel processing on microcontrollers, assign inference to one core and radio handling to another, and compress features to keep bandwidth low. We also dig into continual learning and federated averaging, outlining strategies to adapt models locally while protecting privacy and avoiding catastrophic forgetting. Along the way, we share early results from agriculture and public safety pilots, plus the gritty realities of hardware constraints, scarce datasets, and the challenge of testing at scale.
If you’re curious about TinyML, edge AI, and how generative models might run collaboratively across many small devices, this conversation lays out a practical path forward. You’ll come away with a clearer picture of when decentralization beats centralized cloud systems, which protocols survive in noisy environments, and why the future of AI may look less like a monolith and more like a swarm. Subscribe, share this episode with a builder who loves constraints, and leave a review to tell us where you’d deploy a swarm of tiny models next.
Learn more about the EDGE AI FOUNDATION - edgeaifoundation.org
Why Decentralized AI Matters
SPEAKER_00So I work in two universities in Tiny ML, HAI, so I'm very passionate about code. Sorry. So I'm very passionate about the subject and I'm very active in a way to bring Tiny ML HAI to the local communities, to end users, especially in UK. The reason why I wanted to present this talk, it was last minute confirmation of my presentation. But the reason why I wanted to do this talk because I feel this is a very important topic. It could be potentially the future of HAI or even AI, decentralized AI moving away from the cloud and servers or centralized infrastructure.
5G Limits In Sherwood Forest
SPEAKER_00Over the years, I've been working in pervasive computing area, and the motivation behind this area is to bring intelligence and AI, smart sensors to everyday environment from retail to agriculture to parks and home environment. It happens that we had a project funded by the government which aimed to utilize 5G network in a big park. It's actually a forest, Robin Hood Forest, Sherwood Forest in north of Nottingham. But what happened is that the whole project was about utilizing 5G and showing innovation with 5G. And we had a few telecom companies. But what happened is that many of these companies couldn't set up a proper 5G to cover the whole park because the park was very, very big and the forest as
Rethinking Edge And Collaboration
SPEAKER_00well. So then we had, and um my team were aiming to embed little devices that can communicate and motivate people to move and be physically active and interact. We use some gamification approaches and so on. So we had to find another way to communicate between these devices using you know 5G wasn't uh viable. We wanted even to you know find uh uh um uh internet access to download our apps, our uh services, and so on, but it wasn't possible. So that's why at that stage I started to think more and more about edge AI, on how can we utilize decentralized uh approaches to be able to process things away from centralized infrastructure. Um but also beside that, one device wouldn't be enough. If you have a large number of people coming to a park, you need, you know, to utilize all these devices. You need these devices to communicate. Like, for example, if you want to allow people to play a game and you want a scoreboard, you need some kind of a system where you can exchange information between these devices. So you need this kind of collaborative system that can process and exchange. So you need wireless communication and all of these tools. And that's why we started to think more with my team about the uh decentralized AI approach. And there are many names for it. So you'll see during my presentation I call it distributed, I call it federated, I call it uh decentralized because it's still a very much a new area in its emphasis, and we're still trying to establish it. And I realized that um I need many, many more days to add all the components that can that contribute to the system. So you might find some slides slightly missing here. So
Pooling Tiny Devices As One Brain
SPEAKER_00okay. Uh this is just like a little illustration showing you like the vision in the future. Obviously, you will not see HAI written on sheep in the future, but but at the moment, if you go to a farm north of Nottingham, you will see our callers with HAI placed on cows who are not behaving at all. They have been chewing on our devices all day long and on the solar uh panels that we've um printed. But that but that's the vision that HAI should be everywhere, but without you seeing the devices. They should be embedded. I don't know if you know the concept of disappearing computing. So computing is there, but you don't see it, it empowers people. And I I don't need really to tell you how many devices are in the world. And I asked the question yesterday about like how many of these devices are utilized around the world? Um we still don't have, at least in the UK, we don't still don't have a working wearable system, for example, for healthcare. We don't have even a working wearable system for anything other than you know just a personal device that can be used for you know um general lifestyle. So there are many devices. Hardware, of course, can be if can evolve, but it there's there are a lot of chips uh out there, but we're not utilizing them. And these are potential resources that can be utilized. Oops. How do I go back? Okay. So what happens if we bring all of these devices and try to utilize their capabilities? Let's say, for example, we in this room we have, let's say, 20 wearable devices, whatever. Each device has one megabyte. Um together, let's say we have about two uh 20 megabytes of processing power. Exclude about two megabytes for the communication, then you have 18 megabytes, which is much, much better for running a model than one megabyte or half megabyte. So that's the idea. What if we bring all of these devices together as a computational resource for resource sharing, but not just the memory, not just the processing power, but also other things. You know, this device, for example, might sense something, the other device might might sense something else. So they can complement each other's. And that's why, again, we I mean collaborative learning has been there for a long time. Distributed learning has been there for a long
Search And Rescue As A Testbed
SPEAKER_00time, like for example, using um desktop devices to run something. Or federated learning is there as well. But the idea here is that if we have these edge devices doing their own thing, but then contributing to the overall model or learning, um then we have a bigger resource. It's still not as big as data center, but it's still powerful enough. And imagine if you have thousand, or like for example, in the case of uh football match where you know you have so many um fans with a lot of wearables, then you really have a considerable resource that can help at least in managing the event or the or um coordinating chants or you know songs and things like that. Um so yeah, so what happens if we distribute these tasks across all devices or multiple devices, um, multiple sensors, and then compress these features and make them float between these uh devices, and then you have all of these resources operating as one, reducing um memory per node, and then um you have a um a model, not necessarily bigger model. So you have, I mean, I'll explain later that how you have um a local model and uh global model. So again, uh just to put more emphasis on uh collaborative learning, because you could do distributed learning, each device can do its own thing, uh, and this device is identical to the other ones, or each device can do something different. So take the case, for example, in the case of search uh uh and rescue scenario, if you have like a number of people trying to find a missing person, right? So you could um ask um 20 people to go uh in the field and find um um a lost child or a person. They all could have a camera, an audio exchanging information, they all could be looking for a specific object or face recognition. But what if each device can look at um the scene from different um angles? And they work like uh to build a jigsaw so each one can find, like for example, this part of the body might be occluded where the other part is visible to the other person. So collaborative learning could be complementary, they could complement each other's, and then one angle is uh uh like the the body is um visible from one angle and then uh it might not be visible from the other side. So that's the idea. How can we do that in a way that each device can do something that is complementary to the other devices? But then also we could do parallel processing not just on devices, we could do parallel processing on the device itself.
Parallel Processing On Microcontrollers
SPEAKER_00So, for example, if you are using uh dual-core processor um with TPU, let's say, for example, or MPU or extra uh processing units, you could uh distribute the model on all of these devices, on all of these processors, right? So, for example, on ARM processor, you could use um the M4, M7, or in the case of the Google Coral, you could use the TPU. And we've done that in multiple projects. And this could save time. So you don't want some of these uh processors to be idle while you are processing. You want to utilize each resource as much as possible because you really need these extra few bytes or these extra uh processing powers. And and there are many types uh for parallel processing. I mean, you could, for example, do uh data parallel processing where each uh processor, for example, processing the same data, or um you could uh do a modular kind of processing where like each each processor is taking part of the model and processing, and then um then uh all the features uh uh and the out uh output of the uh initial model are fused to uh give uh an overall model. Um this meant to be for later. And then, of course, I mean we can't talk about this without talking about generative AI,
Generative AI At The Edge
SPEAKER_00right? Because um everybody wants to do generative AI. But again, you know, like generative AI is very much um interesting, but also power-hungry, and it needs uh you know um much much more uh processing capabilities. But again, here we have a better chance at utilizing wearable and distributed devices to run generative AI. I have many students at the moment working on this. I mean, this is still work in progress, but you could uh imagine that you uh distribute, like for example, you have a local domain knowledge, so this device is collecting different types of um information than this one, so you have different domains interacting with each other's. You might have different prompts. So just imagine, like, for example, here with all your devices collaborating and giving me feedback, or like writing a story together, right? So everybody will be using different prompts, but collectively we will build a story together. And I think that's that's very interesting for collaborative uh interaction, and that could be generative. Um I think I have a better slide for this. So the output will be localized summaries, actionable insights, and natural commands. But of course, I mean generative AI is one way around it, but you could do any other kind of models, you could do multi uh uh different modalities, you could use uh sensing, you you could do vision, and all sorts of things. Doesn't need to be generative. And there are many, many ways to do um decentralization.
Heterogeneous Swarms And Networking
SPEAKER_00Like, for example, in some of the projects we have heterogeneous systems where we have different edge devices that are either wearable, they are drones, there are robotics, or they could be the same. Like, for example, when I mentioned the case of wearable in here, everybody has, for example, a photo or has a wearable device. Uh so it could be the same type of device or multiple types of devices, they could be identical. Each device can do something different or identical. I mean, uh, and also doesn't need to be big. It could be one-to-one, for example, collaborative system, like for example, two people um walking around the park, having a conversation, trying to explore a topic. Um, but there's an important element to this that normally the HAI community doesn't think too much about it, which is communications. Because if you want to do this, you really need to think about communications, wireless communication in particular. So we're saying that we're not going to rely on the infrastructure, we're not going to connect to the cloud. So, how do we communicate between these devices? So we need some uh form of communication, wireless in particular, short range or mid-range,
Opportunistic Wireless And Protocols
SPEAKER_00but it has to be low power and has to be uh robust enough to enable that communication. And uh it it it also has to be available in a way that uh let's say uh uh robust, like uh I don't know if you know the concept of opportunistic networking, where you know uh not come to the network and disappear every now and then, or ad hoc networks uh or delay tolerant networks. These are old concepts in communications, but they are very important for uh these types of scenarios. Then when you start thinking about communication, you need to think about where to process, where to handle the communication, because you could run the models on the device itself, but then where do you handle communications? Still, you need the the processor, the same processor, for example. So some of the the things that we try to do, like to run a model on one processor and handle the communication on another processor, again you are using some kind of parallel processing, but for different um tasks. And and beside the the physical wireless communication, we need to think about the communication protocols, what kind of protocol that we should use to enable this type of interaction. On top of that, uh you have we we we think about continual learning, adaptive
Continual Learning And Global Models
SPEAKER_00learning. And and this is an important part of distributed systems. Uh it's not like, for example, for other cases of wearable, you could use about personalization, you could choose to do uh continual learning. But in the case of distributed learning, the decentralized AI, we have to do it. And the reason for that is that uh we have to um manage or overcome uh catastrophic forgetting. Uh as a concept where uh, for example, the the model has learned something, and then uh you update the model with new data and it might forget the previous uh learning. So we need to make sure that the model can be updated but without forgetting the previous learning. And in our case here, if you have a lot of nodes, um maybe I could I could show how this looks like just to explain the concept. Yeah, so so you will have a local model to do the local data processing, but then you will have a global model. The global model is really floating uh across all the devices. And here, um in the global model where we do the continual learning, where we do we update the model, whether it's like um updating or retraining multiple
Case Studies And Early Results
SPEAKER_00layers or the five fine-tuning the final layer. Um, and that's why continual learning is very important in this case. I will go back to where I was very quickly. So um because of the time limitation, I'm not going to be able to explain everything, but I will give you some examples of some of the things that we've been doing in my uh lab. Um if I go, can I go back?
unknownYeah, okay.
SPEAKER_00So, in terms of parallel processing, we've been using regular um uh microcontrollers to test if we can, like, for example, do multitasking. Um, and we we have a paper, and we could read about our work in terms of uh parallel feature fusion um accepted recently for publication. Um the idea here is that you have one processor um running um image uh model um CNN and another running audio CNN, and then you fuse everything and process the fusion on the M7, for example. So this is quite interesting because you could really save some time and reduce latency. Um but one of the key issues that we have faced, and that's why it's very important for us to work with um hardware designers, is that we've noticed that there's no shared memory between the processors. So we had to find um um uh a way around it, you know, like we had to hack it uh using um uh procedure call, back call to be able to um communicate the parameters between the uh different processors. But this is something very important. If we want to do parallel processing, we need this type of shared bit memory, we need the hardware design to take into account that people will be able to process multiple things at the same time and maybe even have having more processors, maybe four processors or more on the regular
Testing At Scale And Next Steps
SPEAKER_00um microcontrollers. And in this in this particular paper, we saved around 48 milliseconds by just uh doing this. So I will rush a little bit here. Um we we applied similar things uh for for uh agriculture. We have multi uh tasks on microcontrollers um in the farm, our devices under in the field, we have LoRaWAN, uh we use uh ST has very good um um uh microcontroller with LoRaWAN interface. So sometimes our choice of microcontrollers, not the baby or processor, it's the um wireless interface or the other sensors that are available on the device. And another big problem, I mean, we talked about this, the data sets issue, where to find the data sets for the students to work on. I mean, this is this is completely a different problem, and we struggle with this all the time. And especially with distributed systems, like where do you get the data? How do you test this system uh and and where do you test it? Um also we we we did some experiments to see you know the running time on different processors, and then going to the concept of distributed system very briefly. This is the typical uh pipeline for HAI. Um and then I explained this briefly before. Um again, we have a paper on the archive in this area to explain the concept, but uh more or less you have um model that tries to converge on multiple devices, and in this case, we assume we have hundreds of devices, many clusters, intra-clusters, interclusters, and we average uh the so we use uh federated average uh to uh inform uh the different clusters. Um and the initial results showed very, very promising uh progress, but obviously it depends on the data set. I'm just rushing here, and we applied this in in many, many um similar scenarios, like what I mentioned before in the park.
Events And Community Invitations
SPEAKER_00This is the park I mentioned earlier. Uh, we've been working with the Met Police because this is very important to be able, like for example, to find um uh someone in high street, you know, by by uh the the policing services, or for example, for um uh traffics or uh vehicular uh networks. Um again we have another uh strand of work. And let's see the last. Sorry, this is yes, just just to uh um finish this with to say this is quite interesting, um, but it's it requires a lot of work. That's why we try to uh tackle many, many issues, whether it's the continual learning, whether it's the data sets, whether it's the um, you know, like where where do you run this model and um how do you uh exchange the information, how do you deal with the gossip uh protocol or do you use different protocol for the communication? But our biggest challenge, and I I mentioned this uh to many of you, is the test bit. How do you test this? How do you evaluate it? How do you um run the model? Because you need hundreds of devices to be able to do this. So sometimes we do simulation, but sometimes we use a small number of devices to run this. So this is quite difficult, so it's it requires more attention from the community. Very briefly before I finish, um we have some upcoming events, uh, the retail event in July, we have um wearable, AI unwearable, uh digital health in UK. Um I'm sharing this event, and then we have HAI Japan, Tokyo in October. If you are interested, please let me know. And finally, we have a new uh funded network uh called TinyML UK, uh where we will have a network manager to complement what HAI Foundation is doing to bring the uh HAI community in UK to collaborate with the wider community.
unknownBut uh that's all. Thank you.