Automated Feature Extraction (AFE)

While automated computer vision has proven very successful in well-constrained industrial environments where illumination, object types, and orientations are known a priori, it is exponentially more difficult to analyze and extract features in high-resolution satellite and/or airborne earth imagery given the wide intra-class feature variations (spectral, spatial, context, etc.) found within individual scenes and spatio-temporal variations across many scenes. However, given the rapid growth in computational capacity and the corresponding development of more sophisticated image processing techniques, we believe that automated feature extraction (AFE) algorithms can be developed to achieve a level of proficiency required for robust upstream processing of multi-modal datasets.

Much of our AFE research has focused on urban-area feature extraction utilizing high resolution Commercial Satellite Imagery (CSI) like that shown below from Space Imaging's Ikonos satellite. Automated extraction of urban features and fusion of these features with other intelligence information can produce "smart" vector products that contain a wide variety of information for each feature. CSI data typically have four multi-spectral (MS) bands (R, G, B, NIR) and one panchromatic (PAN) band with spatial resolutions on the order of 2.4-4 m for the MS data and 0.6-1.0 m for the PAN data.

Advantages of CSI data are as follows:


Road Extraction

We have developed a number of fully automated road extraction algorithms and additional research in this area is ongoing. A linear feature extraction (LFE) algorithm is illustrated below where the directional spectral similarity is measured for each non-vegetated pixel (identified using an NDVI threshold). Large linear segments with a high length-to-width ratio are identified as candidate road segments. These are then combined in a piecewise linear fashion to form road networks even when the network topology contains curvilinear roads.

Another approach utilizes image segmentation followed by perceptual grouping rules to combine both segments and road centerlines derived from the segments. An illustration of this is shown below.

We then employ multi-detector fusion strategies to combine the output of the different road detectors in an optimal way. The approach dynamically and automatically uses the road detector output that is best suited for a particular road network topology (gridlike, curvilinear, etc.) based upon local image analysis. Examples of the integrated multi-detector road extraction results for an urban and suburban area are shown below.

Evaluations of the extracted road network using representative test sites show completeness values that range between 70-86% and correctness values that range between 70-92%. These are amongst the highest accuracies for fully automated road extraction reported in the literature thus far.

 

Building Extraction

We have developed a number of fully automated building extraction algorithms and additional research in this area is ongoing. Multi-detector strategies are needed for robust performance given the heterogeneous nature of the spectral signatures, structural morphology, and spatial topology of building features in high-resolution satellite imagery.

By applying mathematical morphology operators to high-resolution images, the multi-scale image structure can be decomposed. Morphological operators can separately identify objects/structures that are brighter or darker than the surrounding area. The images below show where compact objects, both bright and dark, have been identified in the urban image at one particular scale. Many of the bright building structures in the image are captured in the center image, and many of the dark L-shaped building shadows are identified in the right image. This type of decomposition can be performed at many scales to identify features/objects of varying shapes and sizes.

The multi-scale objects can then be analyzed based on shape, size, spectral content, spatial context, etc. to identify and extract 2D building polygons. For example, detection of building shadows can be used to identify the buildings causing the shadows as shown here:

An example of the multi-detector building extraction for a complex dense urban scene is shown below. Automated 2D building vector extraction accuracy for this urban scene was about 73%. This is the highest accuracy for fully automated building extraction reported in the literature thus far.