
Tag: geoanalytics



3 February 2022, Media Release
Media release, 3 February 2022
In a quest to expand its smart city initiatives, Montpellier Méditerranée Métropole has chosen a consortium that includes Gisaïa among 10 others consortia in a smart city initiative for digital technology at the service of the territory’s project. It is a one year experimental work for the metropolis to offer new services to the city.
The competitive call for projects received over 30 innovation applications covering diverse themes: mobility, culture, energy, the environment, living in the city, digital inclusion and IOT.
Gisaïa, ICIA Technologies, and Patrick Gendre, who is a mobility expert, proposed the “Open Mobility Dashboards” project.
This project responds to the Montpellier Méditerranée Métropole mobility plans for 2030. The Métropole is looking to develop mobility indicators that will support data-backed decision making to improve travel for its citizens. arlas.city, our public transportation data analytics solution will support the initiative in developing public transport indicators. The Open Mobility Dashboard will also include mobility indicators for pedestrians and cyclists.
The projects were selected based on the the following criteria
In France, Montpellier, joins Tisseo collectivities in using arlas.city for public transportation decision making.
Arlas.city supports public transport operators, authorities and researchers in ensuring that their decisions are founded on data. Various business departments in the Montpellier Méditerranée Métropole will provide their expertise to improve feedback and knowledge of the application contexts of the services developed by the winners.
22 Octobre 2021, Media Release
Media release, 22 Octobre 2021
Our team joins others from the French space sectors to form a consortium that has been chosen by the Directorate General of Enterprises (DGE) and Centre National d’Études Spatiales for the DOMINO-X project.
We are proud and happy to contribute to this innovative initiative that will exploit Cloud and Artificial Intelligence technologies to standardize the architectures of ground-based Earth Observation segments and promote the emergence of a modular product and service offering. This will facilitate a need to offer customers lower costs for their requests.
Airbus Defence and Space are leading the DOMINO-X consortium that brings together other experts from: Airbus, Thales Alenia Space, Safran, CS Group, Orange and Capgemini, Gisaïa, Stack Labs, Human design Group, Geotrend and Leanspace.
Gisaïa will be fronting our solution, ARLAS for geospatial data analytics . This is not the first time that Gisaïa joins some of the consortium members and offers solutions that simplify the access to Earth Observation products. ARLAS is part of the framework that Theia and Dinamis are built on.
ARLAS is open source architecture software that is also built on the latest big data technologies making it both easily interoperable with other solutions. It scales easily with a growing archive . This is important in the DOMINO-X mission which seeks to explore volumes of Earth Observation data, from diverse product families. But that is not all, ARLAS will also be offering interactive visualisation of the data with a simplified filtering process that makes it easy to quickly sort through data and find what one is looking for.
The DOMINO-X ultimate quest is to deliver valuable insights for decision making from Earth Observation data by making it accessible to as many users as possible.
April 09, 2020
To better understand animal behaviour, many researchers are equipping some individuals with GPS beacons and sensors to track their movements. This new data are enabling major advances in the understanding of our biodiversity but also of climate change. To make it possible to analyse data that are voluminous, it is necessary to use high-performance tools.
At Gisaïa, we are sensitive to environmental concerns and are developing ARLAS Exploration, an open-source software for geo-spatial data mining capable of responding to Big Data challenges. We have therefore decided to put ARLAS to the test in order to make it possible to explore animal data interactively.
Data from a study on a storks population was used, and we will see how ARLAS can be used to discover the behaviour of these migratory birds. We will also see how it can accompany the implementation of Machine Learning algorithms to automatically detect some of these behaviours.
The Max Planck Institute of Animal Behavior, a German research institute that studies wildlife, recently launched Movebank, a platform that aims to centralise data from animal studies conducted by researchers around the world, to encourage collaboration and make it possible to cross-reference data from different studies. This initiative also aims to make this data freely available to the general public.
In a study by Cheng & al. (2019) [1], researchers equipped a population of 169 storks with GPS beacons and collected data between 2013 and 2019. These data were retrieved from the Movebank [2] platform as CSV files.
The white stork (Ciconia ciconia) is a large species of wading bird in the Ciconiidae family. Its plumage is mainly white, with black on the wings. This species has been the subject of protection and reintroduction programmes and is mainly found in Eastern and Western Europe. The stork is highly migratory and winters in Africa, making its movements particularly fascinating to study.
The data were first processed in order to group and link together the successive observations of the same bird and to calculate the travelled distances. This type of processing is done using ARLAS PROC/ML, our massive distributed processing platform. The data thus formatted is then integrated into ARLAS Exploration and our storks are then ready to be explored.
ARLAS Exploration is a map-centric application that allows to appreciate the spatial dispersion of the data. A bar of graphs on the left of the application also allows to visualise and filter the other dimensions of the data, so we can observe the distributions of different quantities such as the travelled distance, but also the altitude and the speed. At the bottom, the timeline allows to see the temporal distribution of the measurements made.
It is clear that the traced storks move within a perimeter that extends from southern Germany to West Africa. There is a peak in the number of observations in August 2014. In addition, data are actually available for 81 birds for a total of about 7 million positions.
The various graphs allow you to filter the data on the dimensions represented and the whole application is instantly updated with each new selection. It is also possible to navigate the map on certain areas and filter according to geographical selections drawn with a tool on the right side of the application.
Depending on the amount of data to be displayed in the application window, ARLAS switches from an “aggregate” mode, density maps ideal for general visualisation of the flows, to a “features” mode, the detail of the actual data to observe the actual paths of the storks. We can thus isolate interesting behaviours, and see that some storks seem to use hot airstreams to gain altitude, for example:
The actual route displayed can be colored in different ways, depending on the speed or the bird ID for example:
These two representations allow us to understand that the four selected storks named Hans, Schwitza, Kiki et Julia, move together at the same pace over this period, while still being able to clearly distinguish the four storks.
Thus, ARLAS Exploration is a tool particularly adapted to interact intuitively and interactively with bird positioning data, even when the volume of data becomes large. This makes it a strong partner for researchers.
The observed storks tend to migrate great distances to change habitat locations. One can then distinguish two attitudes: staying in the same area (local) or travelling to change areas (travel). Both of these behaviours are locally visible to the naked eye in ARLAS Exploration, but automatic detection of these activities could make it possible to study the migrations of all birds on a large scale in an extremely efficient manner. This is why we have chosen to use Machine Learning algorithms to automate this identification. Supervised learning was carried out to train the classification model.
This process was therefore carried out in several stages:
A supervised classification model needs training data to learn how to recognise targeted behaviours. In our case, it is necessary to annotate our data. Each fraction of a trip must be identified as ‘travel’ or ‘local’.
ARLAS Exploration is a tool particularly adapted to the creation of training sets since it allows to assign a label to the current data selection. It is therefore possible to manually identify the parts of trajectories corresponding to a large displacement (“travel”) or a local activity (“local”) and label them as such.
In practice, 4 birds have been labelled as such:
For each bird, a period of approximately a year was used to capture at least one round trip in the migration. We therefore have a total of 334,372 fragments (interval between two measurements) that will be usable for training the model.
The quality of a Machine Learning algorithm depends above all on the quality of its training data. Once a sufficient number of representative fragments of the behaviors to be detected have been labelled, it is necessary to choose the sizes that will be given as input of the model. First of all, the tagged data can be retrieved using an ARLAS API, available in Python among others, which allows the training data to be downloaded.
In our case, the selected features illustrating the movement of these birds will be based on travelled distances and “as the crow flies” distances” calculated over different time windows. These features are not present in the raw data and can be calculated using ARLAS PROC/ML, our processing platform adapted to large volumes of data.
Once these new quantities have been calculated, they can be used to train the different chosen Machine Learning models.
Several classification models have been tested. In order to be able to compare the quality of these classifiers, the calculation of indicators is necessary. Since the classes are disproportionate (~6% travel in the training set), several metrics were used to correctly evaluate the quality of ‘travel’ detection. A “cross-validation” is performed to avoid overfitting by partitioning the training set and measuring classification performance on data that is not used during the training of the model.
The metrics used are based on the confusion matrix of the prediction:
We have:
As all experiments are performed under the same conditions, the models can be compared with each other, in particular thanks to the MLFlow tool used to record the results. Finally, after numerous experiments, an XGBoost classifier was chosen, both for its performance and its training speed.
Once the model has been chosen and trained, it can be applied to other birds and the results of this migration detection can be exported to ARLAS Exploration thanks to the tagging system (also available via API). It is then possible to visualise the results directly in the application. This allows a better understanding of our model by quickly identifying on which part of the data the predictions would fail. It is also possible to validate or correct the results, which makes it possible to increase the training set and to train the model again on more data. In the case of our storks, the model was applied to 26 birds, corresponding to 2,650,000 fragments.
We can also follow the track of a particular bird and identify the different stopping places along its route. For the stork named Zozu, for example, the following results are obtained:
We can also date the great movements of these storks. If we consider the predicted travel fragments of the stork named Zozu, for example, we can observe the different peaks on the timeline, which makes it possible to identify the periods of the year when the bird migrated:
If we look at one of these peaks in particular, we can date these migrations very precisely and see the different stages of the journey, so this journey between Switzerland and northern Spain between 21/08/2015 and 29/08/2015 took place in 7 stages:
The migration of these birds can also be explored on a larger scale. If one selects all the fragments in ‘travel’ for all the labelled birds, one can see a migration corridor following the Mediterranean coast towards Spain in the south of France:
Finally, it is possible to identify the different living places favoured by the storks during their journey around the Strait of Gibraltar by selecting the ‘local’ predicted fragments for all the storks:
The automation of migration detection has therefore greatly facilitated its analysis and ornithological experts can now focus on the variations in dates and destinations of the migrations undertaken by the storks. The possibility of cross-referencing this information with other data, such as meteorological data, can provide an even better understanding of the behaviour of these large migratory birds in relation to, for example, climate changes.
We have seen that ARLAS is particularly well suited for exploring position data of storks. The interactive map navigation provides valuable information on the behaviour of these migratory birds. But ARLAS can also be used to support the production of Machine Learning algorithms by facilitating the creation of training set and the visualisation of classification results. Finally, once the Machine Learning models have been trained, it is possible to apply them to large-scale data with ARLAS PROC/ML and see the results in ARLAS Exploration. All these results are available in a demonstration that is available at demo.
If we have focused here on migration, many other animal behaviour could be the subject of such studies. Obviously, ARLAS can be applied to all kind of geo-traced animals, but also to all geo-referenced data. Feel free to have a look at the other application examples on demo.arlas.io.
Thanks:
We would like to thank the director of the Max Planck Institute of Animal Behavior, Dr M. Wikelski ,and his team for providing this data.
References:
[1] Cheng Y, Fiedler W, Wikelski M, Flack A (2019) “Closer-to-home” strategy benefits juvenile survival in a long-distance migratory bird. Ecology and Evolution. doi:10.1002/ece3.5395
[2] Fiedler W, Flack A, Schäfle W, Keeves B, Quetting M, Eid B, Schmid H, Wikelski M (2019) Data from: Study “LifeTrack White Stork SW Germany” (2013-2019). Movebank Data Repository. doi:10.5441/001/1.ck04mn78