Deutsch
 
Datenschutzhinweis Impressum
  DetailsucheBrowse

Datensatz

DATENSATZ AKTIONENEXPORT

Freigegeben

Konferenzbeitrag

Using supervised machine learning based on data quality indicators for automated data cleaning of GNSS position time series

Urheber*innen

Bamahry,  Fikri
IUGG 2023, General Assemblies, 1 General, International Union of Geodesy and Geophysics (IUGG), External Organizations;

Legrand,  Juliette
IUGG 2023, General Assemblies, 1 General, International Union of Geodesy and Geophysics (IUGG), External Organizations;

Pottiaux,  Eric
IUGG 2023, General Assemblies, 1 General, International Union of Geodesy and Geophysics (IUGG), External Organizations;

Bruyninx,  Carine
IUGG 2023, General Assemblies, 1 General, International Union of Geodesy and Geophysics (IUGG), External Organizations;

Fabian,  Andras
IUGG 2023, General Assemblies, 1 General, International Union of Geodesy and Geophysics (IUGG), External Organizations;

Externe Ressourcen
Es sind keine externen Ressourcen hinterlegt
Volltexte (frei zugänglich)
Es sind keine frei zugänglichen Volltexte in GFZpublic verfügbar
Ergänzendes Material (frei zugänglich)
Es sind keine frei zugänglichen Ergänzenden Materialien verfügbar
Zitation

Bamahry, F., Legrand, J., Pottiaux, E., Bruyninx, C., Fabian, A. (2023): Using supervised machine learning based on data quality indicators for automated data cleaning of GNSS position time series, XXVIII General Assembly of the International Union of Geodesy and Geophysics (IUGG) (Berlin 2023).
https://doi.org/10.57757/IUGG23-3291


Zitierlink: https://gfzpublic.gfz-potsdam.de/pubman/item/item_5019692
Zusammenfassung
Data cleaning is one of the most challenging steps in GNSS data analysis. Besides being time-consuming, failure in this process can lead to inaccurate and unreliable daily GNSS position and velocity estimates. To date, GNSS data cleaning was performed by finding positions that statistically differ from other positions without knowing the reason that is causing the outlier. However, we know that the degradation of data quality is one of the key factors influencing the quality of GNSS position estimates. We implemented a supervised machine-learning algorithm to predict, based on daily GPS data quality indicators of a GNSS station, if the daily GNSS data will be good enough to result in a reliable station position. To do so, we investigated the correlation between degraded GNSS data quality indicators and outliers in daily GNSS position time series. Six GNSS data quality indicators (number of observed versus expected observations in dual frequency, the lowest elevation cut-off observed, number of missing epochs, number of satellites, number of observations, and number of cycle slips) were employed to construct a predictive model that is able to detect outliers in daily position time series. The algorithm was trained using 12 years of position time series for 200+ GNSS stations and their associated daily data quality indicators. Based on this, we assessed the most relevant GNSS data quality indicators for detecting outliers in the GNSS position time series. We present the current development of this automated algorithm, the challenges we faced, and the preliminary results of this work.