Multitemporal Remote Sensing for Urban Mapping using KTH-SEG and KTH-Pavia Urban Extractor

University dissertation from Stockholm : KTH Royal Institute of Technology

Abstract: The objective of this licentiate thesis is to develop novel algorithms and improve existing methods for urban land cover mapping and urban extent extraction using multi-temporal remote sensing imagery. Past studies have demonstrated that synthetic aperture radar (SAR) have very good properties for the analysis of urban areas, the synergy of SAR and optical data is advantageous for various applications. The specific objectives of this research are:1. To develop a novel edge-aware region-growing and -merging algorithm, KTH-SEG, for effective segmentation of SAR and optical data for urban land cover mapping;2. To evaluate the synergistic effects of multi-temporal ENVISAT ASAR and HJ-1B multi-spectral data for urban land cover mapping;3. To improve the robustness of an existing method for urban extent extraction by adding effective pre- and post-processing.ENVISAT ASAR data and the Chinese HJ-1B multispectral , as well as TerraSAR-X data were used in this research. For objectives 1 and 2 two main study areas were chosen, Beijing and Shanghai, China. For both sites a number of multitemporal ENVISAT ASAR (30m C-band) scenes with varying image characteristics were selected during the vegetated season of 2009. For Shanghai TerraSAR-X strip-map images at 3m resolution X-band) were acquired for a similar period in 2010 to also evaluate high resolution X-band SAR for urban land cover mapping. Ten  major landcover classes were extracted including high density built-up, low density built-up, bare field, low vegetation, forest, golf course, grass, water, airport runway and major road.For Objective 3, eleven globally distributed study areas where chosen, Berlin, Beijing, Jakarta, Lagos, Lombardia (northern Italy), Mexico City, Mumbai, New York City, Rio de Janeiro, Stockholm and Sydney. For all cities ENVISAT ASAR imagery was acquired and for cities in or close to mountains even SRTM digital elevation data.The methodology of this thesis includes two major components, KTH-SEG and KTH-Pavia Urban Extractor. KTH-SEG is an edge aware region-growing and -merging algorithm that utilizes both the benefit of finding local high frequency changes as well as determining robustly homogeneous areas of a low frequency in local change. The post-segmentation classification is performed using support vector machines. KTH-SEG was evaluated using multitemporal, multi-angle, dual-polarization ASAR data and multispectral HJ-1B data as well as TerraSAR-X data. The KTH-Pavia urban extractor is a processing chain. It includes: Geometrical corrections, contrast enhancement, builtup area extraction using spatial stastistics and GLCM texture features, logical operator based fusion and DEM based mountain masking.For urban land cover classification using multitemporal ENVISAT ASAR data, the results showed that KTH-SEG achieved an overall accuracy of almost 80% (0.77 Kappa ) for the 10 urban land cover classes both Beijign and Shanghai, compared to eCognition results of 75% (0.71 Kappa) In particular the detection of small linear features with respect to the image resolution such as roads in 30m resolved data went well with 83% user accuracy from KTH-SEG versus 57% user accuracy using the segments derived from eCognition. The other urban classes which in particular in SAR imagery are characterized by a high degree of heterogeneity were classified superiorly by KTH-SEG. ECognition in general performed better on vegetation classes such as grass, low vegetation and forest which are usually more homogeneous.It is was also found that the combination of ASAR and HJ-1B optical data was beneficial, increasing the final classification accuracy by at least 10% compared to ASAR or HJ-1B data alone. The results also further confirmed that a higher diversity of SAR type images is more important for the urban classification outcome. However, this is not the case when classifying high resolution TerraSAR-X strip-map imagery. Here the different image characteristics of different look angles, and orbit orientation created more confusion mainly due to the different layover and foreshortening effects on larger buildings. The TerraSAR-X results showed also that accurate urban classification can be achieved using high resolution SAR data alone with almost 84% for  eight classes around the Shanghai international Airport (high and low density built-up were not separated as well as roads and runways).For urban extent extraction, the results demonstrated that built-up areas can be effectively extracted using a single ENVISAT ASAR image in 10 global cities reaching overall accuracies around 85%, compared to 75% of MODIS urban class and 73% GlobCover Urban class. Multitemporal ASAR can improve the urban extraction results by 5-10% in Beijing. Mountain masking applied in Mumbai and Rio de Janeiro increased the accuracy by 3-5%.The research performed in  this thesis has contributed to the remote sensing community by providing algorithms and methods for both extracting urban areas and identifying urban land cover in a more detailed fashion.