Recent decisions by major space agencies have been making unprecedented amounts of imagery available for research and operations. This brings about a unique opportunity to measure the global and local changes in our environment and assess the human impacts on land and the oceans. However, current scientific methods for extracting information for Earth observation data lag far behind our capacity to build sophisticated satellites. These satellites produce massive amounts of data, but only a fraction of that data is effectively used for scientific research and operational applications. Our work addresses a key scientific problem: How can we substantially improve the extraction and validation of land use and land cover change information from big Earth Observation data sets in an open and reproducible way?
In response to this challenge, we are building and deploying a new type of knowledge platform for organization, access, processing and analysis of big Earth observation data. The platform uses a scientific database based on the open source SciDB innovative array database management system, capable of managing large remote sensing data sets. We are developing an innovative set of spatiotemporal image analysis methods, mostly based in analysis of satellite image time series. These analytical methods use the open source R statistical language.
The results we have already obtained and will be presented at the conference include:
(a) A Web Time Series Service (WTSS) for serving time series extracted from remote sensing imagery on the internet.
(b) A Spatial Data Infrastructure (SDI) architecture for big EO data sets.
(c) A time-weighted version of the dynamic time warping method for land use and land cover classification using remote sensing image time series. This method has achieved 90% accuracy for land use change information in large areas in the Brazilian Amazonia.
(d) A new method for near-rela time change detection in tropical forests, based on Kalman filtering.
(e) A set of collaborative validation tools for land use classification using remote sensing time series.
Using array databases such as SciDB and analytical tools in the R statistical language, the resulting infrastructure is a powerful and open infrastructure for big Earth Observation data analysis.