VectorDimensionalityReduction¶
Performs dimensionality reduction of the input vector data according to a model file.
Description¶
This application performs a vector data dimensionality reduction based on a model file produced by the TrainDimensionalityReduction application.
This application has several output images and supports “multi-writing”. Instead of computing and writing each image independently, the streamed image blocks are written in a synchronous way for each output. The output images will be computed strip by strip, using the available RAM to compute the strip size, and a user defined streaming mode can be specified using the streaming extended filenames (type, mode and value). Note that multi-writing can be disabled using the multi-write extended filename option: &multiwrite=false, in this case the output images will be written one by one. Note that multi-writing is not supported for MPI writers.
Parameters¶
Name of the input vector data -in vectorfile
Mandatory
The input vector data to reduce.
Statistics file -instat filename [dtype]
An XML file containing mean and standard deviation to center and reduce samples before dimensionality reduction (produced by ComputeImagesStatistics application).
Model file -model filename [dtype]
Mandatory
A model file (produced by the TrainDimensionalityReduction application,
Output vector data file containing the reduced vector -out filename [dtype]
Output vector data file storing sample values (OGR format). If not given, the input vector data file is used. In overwrite mode, the original features will be lost.
Input features to use for reduction -feat string1 string2...
List of field names in the input vector data used as features for reduction.
Output feature -featout [prefix|list]
Default value: prefix
Naming of output features
- Prefix
Use a name prefix - List
Use a list with all names
Prefix options¶
Feature name prefix -featout.prefix.name string
Default value: reduced_
Name prefix for output features. This prefix is followed by the numeric index of each output feature.
List options¶
Feature name list -featout.list.names string1 string2...
Mandatory
List of field names for the output features which result from the reduction.
Principal component dimension -pcadim int
This optional parameter can be set to reduce the number of eignevectors used in the PCA model file. This parameter can’t be used for other models
Writing mode -mode [overwrite|update]
Default value: overwrite
This parameter determines if the output file is overwritten or updated [overwrite/update]. If an output file name is given, the original file is copied before creating the new features.
- Overwrite
Overwrite mode - Update
Update mode
Examples¶
From the command-line:
otbcli_VectorDimensionalityReduction -in vectorData.shp -instat meanVar.xml -model model.txt -out vectorDataOut.shp -feat perimeter area width
From Python:
import otbApplication
app = otbApplication.Registry.CreateApplication("VectorDimensionalityReduction")
app.SetParameterString("in", "vectorData.shp")
app.SetParameterString("instat", "meanVar.xml")
app.SetParameterString("model", "model.txt")
app.SetParameterString("out", "vectorDataOut.shp")
app.SetParameterStringList("feat", "perimeter area width")
app.ExecuteAndWriteOutput()