 Hello, everybody. First of all, thank you very much for coming to this talk. We are going to present a dashboard. We have been creating the last three months, trying to show the current status of the IoT projects in the open source. So, in this slide, you can see the URL. So, if you want, you can just now go to the URL and start playing with the dashboard. We are going to present now. Also, these slides are in the talk page in the first website. So, you can also download the slides you want now. This is the version 0.1 because our idea is to present you this first version of the dashboard, but from now, try to do it in a community way. Trying to incorporate the views of the IoT community. So, this dashboard is a kind of tool for everybody in the community. So, this is our main goal. So, I am Alberto Castillo from... I'm working right now in things of those, developing another IoT platform. But mainly, I'm an open source advocate, and also I'm developing a separate open source project. Now, I'm working with the PLC4X guys that they are creating a framework for communicating with the industrial protocols in order to offer a common API for communicating with them. And Valerio is presenting this with me. Okay, I'm a Grimoire Lab maintainer and cause contributor, since Grimoire Lab is part of Chaos. And I work for Bitcoin that is maintaining the platform that we used to build the dashboard. Anyway, the platform is open source, and you can use it if you want. Okay, my interest is our source code analysis, ETL processes, and reverse engineering. And I know basically nothing about IoT, but I was just analyzing the data. Okay, so this is an outline of the presentation. We are going to present you some of IoT architectures because we have based our analysis in these architectures trying to find a common categories that we are going to use in the dashboard and the presentation to compare the status of IoT in different categories. And we have followed some criteria to select this open source project because you can imagine that selecting between the hundreds of open source projects that exist in IoT is something that is difficult. So we have decided to focus in the projects that are already in the open source foundation, the main open source foundation, the passive foundation, a keystone foundation, a linear foundation. And after that, we are going to show you some findings and insights and some conclusions about our work. So why are we doing that? What is our main motivation for creating that digital dashboard? Probably the main motivation is, on one hand, understanding the IoT community. I have joined this community around six months ago and I want to know the main projects, the different architectures and to get this knowledge about this world and also to offer visibility of the main projects in open source. Both of us are believers in open source and we want to collaborate as much as possible in this field. And how are we doing that? In Grimoire Lab, we worked together in the past and the focus of the platform is to doing analytics about software development communities. And we are using these tools to extract this information from the several communities in IoT. And as I said at the start, we want contribution by the IoT community in order to have this as a shared tool in the community. Some warnings about our work. The first one is that initially we are pretty focused in the industrial IoT sector. But after some thoughts about it, IoT and industrial IoT are not so different. So in some way they are merging the technologies. So it is not so important. About the project selection and classification is something pretty divided. So for example, when we start showing you the categories and how we have classified the projects in this category, you can say, okay, this is a project that I know about it and you have classified it in a category that is not the best category. So it's something that we can discuss about it and we can modify the report and create future versions improving it with your knowledge. And we are using pretty basic metrics, baseline metrics because we prefer having this as a simple report, simple dashboard in order to send clear messages and if they are interested and we have value in it, we can go deeper. We've created like chaos from Linux Foundation that we have here a photo for past Friday that they have a meeting here in Brussels and we can cooperate with these communities in order to have more advanced metrics. And about the data, we use Creative Combined Licenses for all the data we are producing that the data that we are creating. But you are going to see data from GitHub and this data at the end is from the developers that provided this information. Okay, about the selection of the projects, I have talked about it a bit but at the end we decided to sell a project from the main open source foundations. Probably you can say that with Apache Eclipse and Linux Foundation, also the Mozilla Foundation is working in Web of Things but they are working in a pretty specific part of IOD. These foundations are working in several categories with different projects so we think that they are... that this is a better solution now but this is something that we can change. Okay, and about the comparison, we have analyzed the IOD, we have analyzed different architectures and we have at the end decided to use some categories to classify the projects in order to compare them. So we are going to present them a bit later. So the first step, once we have decided to use the project from the foundation is to define how we are going to classify these projects in different categories. And for doing that we have put the focus in the IOD platforms because they are really complex platforms that cover a lot of different things, a lot of different systems and we have based our analysis in them. So I'm going to spend more time here. These are the basic features in this kind of platforms and the first one we have used is the one that is the reference architecture from the Industrial Internet Consortium that is a consortium that is defining different standards in the IOD fields and they have a strong knowledge and position in the IOD so we find that this is an interesting one. And you often hear that there are three tiers, the S tier, the platform and the enterprise tier. This kind of architecture is the same architecture that we are going to find in the architecture provided, proposed by the different foundations. I'm going to enter in detail because we are going to lose all the time if I do that but if we compare this high level vision the one from Eclipse Foundation that they also propose to have three tiers, the constrained devices, the gateways and smart devices and at the end of the cloud platform you can mob it against the previous one and if you take a look to the proposal from the Lean Foundation the architecture is proposed by them, they are proposed in architecture and also the different projects they are also in some way classifying them in these architectures. The three layers, the three tiers in the same way. So it seems that they are common patterns between all the architectures that we can use to classify our projects in a common way for all of them that this is the goal we have for the presentation. I'm talking about the Apache Software Foundation they don't try to create this kind of approach a strategic approach trying to classify architectures and IOD and so on. They just offer a very attractive framework a very attractive platform to work inside Apache and they have in the big data clearly they have attracted the best open source projects about big data and in IOD they are also attracting some pretty interesting projects but mainly focusing in data. So, project classification. After doing an analysis of all of that we have concluded that the best categories for IOD are these three categories that we have here. Probably in the next release of the report we are going to simplify that and for example the four categories of ecosystem that are pretty generic categories we are going to merge in another one but the idea is to have one category for always mainly real time operating systems and visualization in the edge. Also about protocols, the different protocols used to communicate in this IOD field about the gateways. They are always using the gateways that are a critical part of the platform. Processing of the data. Doing the persistence of the data in the edge. Framework and platforms that are that cover most of the different systems and at the end the application framework is designed to deploy applications that are going to use the IOD information the IOD data and process it for doing added value things and the four kind of ecosystems that are the two last ones. Okay, so now Valid is going to present you the Grimoire Lab. So as Alvaro said the idea of this work was to understand the IOD ecosystem and after defining the category the idea was to have a dashboard so some way to visualize these categories and how to understand this ecosystem. So we use Grimoire Lab. Grimoire Lab as I mentioned before is a platform free open source and is used to make for software development analytics. So in this case we can see that we have this is an overview of Grimoire Lab we have different components. One component is in charge of retrieving data from popular data sources used in software development then we have another component that takes care of digesting this information that is stored in ElastiSearch or JSON based database and then once we have the information here we can visualize it so we can generate reports or in this case the dashboard. So for this specific work we focus basically on two data sources that are Git and GitHub. So we selected the projects we use Perceval to retrieve the data Perceval is a tool that simplifies the life of people who want to extract data from software development tools for instance from Git we extracted commits and from GitHub we extracted poor request and issues. The other two components are GRAL wasn't used for this work but is used to extract information about source code so lines of code, comments, cyclomatic complexity and then RTUR is a tool that is able to execute GRAL and Perceval at scale. So once we collected the data from Git and GitHub we used the data storage component that is Grimoire Elk. Grimoire Elk is in charge of reaching the data so basically we, in this case we are adding information that are not directly available in the data we are collecting so for instance if we focus on Git we could add here information about paid programming. If we focus on GitHub we could add information about the number of comments that is something that is not directly available in the GitHub API. One important part here is the way we deal with identities so in order to understand the community, the IoT community we started collecting the information, email information that were available in Git and GitHub so according to the email, the domain we started classifying and grouping together so then we were able to identify people from Red Hat or people from IBM so large companies that are actively involved in IoT. And then we have so these things were done with Sortiata and Astal and then here we reached a point where we have like just the data and we need just to use the last component to visualize the data. So here that is an extension of Kibana that is part of the elastic search ecosystem and the output that we are going to see right now is basically a dynamic webpage where you can filter and drill down the data to better understand what you have. So this is basically the dashboard so on top you can see that we have different tabs so in this case since we don't have too much time we will focus just on Git but you can access the dashboard and start checking the data. We removed the outer information so you will not see here the name of people, just the name of the organizations so as you can see if we start checking what we found we can see that Linux Apache and Eclipse are basically there they share the IoT ecosystem then here you can see the categories Alvaro mentioned so we can see that for instance virtualization is much more important than fog or integration that seems to be site categories of the IoT ecosystem and then here we have the organizations so the top one is then we have Apache the third one is unknown because we were not able to classify the people there so we were not able to associate their domain to a company. Generally there you can find people from university or freelancer so in that case it's clear that they are unknown and then here you can see on the past 5 years we have around 2000 commits and well 5000 people contributing to IoT so we are talking about an important community and here we can see the evolution of commits and authors grouped by the different organizations and again we see the foundation sorry and here you can see that maybe at the very beginning Apache was the first one but then Eclipse started also contributing to IoT so ok so if we want to change the complete history for the last 10 years is how you can change the the timeline that you are analyzing. You can see that the data is automatically reflected here and now we are analyzing the last 10 years while before we analyzed the last 5 years so and we have prepared some ad hoc visualizations that are this one because the one that you see are basically the fourth ones in GreenWard Lab and here this is a summary of the findings we got so we can see for instance in the category application we can see that the 3 foundation are there so maybe Apache has more commits but Eclipse has a larger community while Linux has more organization contributing to their project and again we can go for instance virtualization and we can see that in this case Linux is leading this part then we have the other categories that are for instance the fourth one there is basically Eclipse there and then we have persistence where there is just Apache so here we have tried to summarize a bit the findings however in the presentation then we have like several slides for the people that want to read the presentation so we have tried to put for each category something we found anyway again if you want to navigate through the dashboard it would be great so then you can like try to answer the question you would like to get answered so I guess I pass you ok so in order to have enough time to complete the slides we are going to show you yes for example the findings we have found for one category we have 10 categories you can take the slides and go for all the categories in order to understand the different things that we have found from the data but for example in the case of OSM virtualization we have found interesting because there is a large community in this field and Linux foundation for example is the one that seems to have more activity to show you the decisions that we have done building this information for example we have considered that the third field project is a project from the Linux foundation that probably is not totally right because it is not a complete Linux foundation project but it is related it's a kind of collaborative project from the Linux foundation so the relationship between the projects and the foundations are pretty clear in some cases but in other ones it is not so clear for example in this category in the case of Apache software foundation they have not a lot of activity here now but the new x-project this morning we have talked about it is joining the Apache foundation so for example in the next release of this report we have seen that for example the Apache started having a lot of commits coming from these new x-projects because it's huge it has a lot of different commits so at the end we try to show the main projects and also the activity the main projects and the organization behind if of the categories we have done this for all the categories so please take a look to the slides if you are interested and you find strange things that is possible because we don't know the details about everything we are more than happy to recite this feedback from you and to start improving this report so these are for example all of the different categories you can use the dashboard and just click in the category and all the dashboard is filtered out for this category and you can analyze the data for it and we have here created a table it's a kind of summary table showing the different categories the main foundation in this category taking a look to the number of commits the size of the community and so on and then the main project in this category for example we have here Apache Cami, Fefi, Pajo, Kura projects that we have for example to talk about then this morning and at the end the main companies that are there so as a conclusion which foundation is the winner from this study that we have done and there are good news that for us for Valerio and me the winner is open source because both of them have a pretty close activity in a global view and taking a look to the different categories you can find that the different foundation are working in most of all of the categories and some are leading one, the other one is leading the other one easy foundation has different strategies so this is pretty good because for example if some of the foundations that are going to try other different strategies so at the end we think that there are diversity that is pretty good for IOD so about the cover-ups of the dashboard because you can think you have selected a project from the foundation but maybe the foundation have not attracted the main IP projects the data with a report from the pack rudder I don't know if you know it but it is a consultancy that is creating reports about different industrial sectors and they have reports about the IP platform based on open source here in Europe and the main companies according to this report are both in Eurodeck that are also the main companies two of the main companies appear in the in the dashboard we feel that it is not the reality and maybe it is not 90% accurate but it is pretty accurate and it is a good estimation of the current status of the IOD community so maybe we are missing the Mozilla foundation with the webfins IOD but probably we will add it in future reports so one things that we have found that is important is the collaboration between the different projects in the different foundations for example the projects in Apache are collaborating with the projects in Apache or also with the project in Eclipse and in Swindish and we don't have a lot of data but for example in the project that I know better the PSE4X they are working closely with Eclipse with Milo Condito and with other projects and for example the Eclipse Cura that is in the IOD gateway that is the leading project in this field is heavily based so it seems that there is a clear collaboration between the projects but we need to as always to improve it but in Open Source it is pretty natural to collaborate between projects but it is not a bad idea to try to improve this collaboration for example with this kind of rooms that is great for the IOD community and with this that we are trying also to offer visibility so you can know the projects that are in the IOD they try to close them together and this is our last slide about the takeaways from this talk the first one is the dashboard that is going to be public for some time we don't know about but Viteria has offered to have it online and to support it so we can come with that for the IOD community and what is important is the contribution that we expect your contribution we have a Jihad repository in which we are opening issues in order for example to improve the classification improve the selection improve the foundation that we are considering improve the insights the different interesting things about the projects so we hope that you can help us in this repository so please if you are interested in improving this kind of information join the projects and we are more than happy than here all your your ideas and for us that's all it's time for question I don't know any questions please raise your hands this is a relative positioning of open source providers for IOT solutions did you also look at close source providers for IOT solutions like Microsoft Amazon to close close source providers yeah close source providers we have put all the focus in open source projects and open source projects inside the foundation so for example Microsoft is doing pretty interesting things in IOT and there are project open source also but they are not inside the foundation this is why they are not here but probably this is something that we need to change and to start because they are a player in this community they are an important player so we should find ways probably in the next release of this platform we try to integrate other projects that are not part of the project but we need to find a clear criteria to do that so we need to think about ways why we are adding for example the Microsoft projects in GitHub and not for example the projects from any other providers so these kind of things we have time for one more question so anyone ok thank you guys ok thank you