Difference between revisions of "Point Clustering"
From OSGeo
(→References) 

Line 22:  Line 22:  
=== References ===  === References ===  
* [http://en.wikipedia.org/wiki/Data_clustering Wikipedia] Article on Data Clustering  * [http://en.wikipedia.org/wiki/Data_clustering Wikipedia] Article on Data Clustering  
−  * [http://  +  * [http://bonsai.ims.utokyo.ac.jp/~mdehoon/software/cluster/software.htm#pycluster PyCluster]: Python Cluster Functions (2013) 
−  * [http://trac.osgeo.org/postgis/ticket/174 Point Clustering Utility Trigger] enhancement idea reported as ticket to PostGIS Trac.  +  * [http://trac.osgeo.org/postgis/ticket/174 Point Clustering Utility Trigger] enhancement idea reported as ticket to PostGIS Trac (2012). 
−  * [http://  +  * [http://gis.stackexchange.com/questions/11567/spatialclusteringwithpostgis "Spatial Clustering with PostGIS] from gis.stackexchange.com (2011) 
−  +  * [http://www.geocomputation.org/2000/GC015/Gc015.htm Using Genetic Algorithms in Clustering Problems]: paper from [http://www.geocomputation.org/ GeoComputation] 2000 conference (2000)  
−  * [http://www.geocomputation.org/2000/GC015/Gc015.htm Using Genetic Algorithms in Clustering Problems]: paper from [http://www.geocomputation.org/ GeoComputation] 2000 conference  +  * [http://www.geocomputation.org/2000/GC024/Gc024.htm Automatic clustering via boundary extraction for mining massive pointdata sets]: paper from [http://www.geocomputation.org/ GeoComputation] 2000 conference (2000) 
−  * [http://www.geocomputation.org/2000/GC024/Gc024.htm Automatic clustering via boundary extraction for mining massive pointdata sets]: paper from [http://www.geocomputation.org/ GeoComputation] 2000 conference  +  * <s>[http://www.nabble.com/%27clustering%27ofpointst1261780.html#a3347923 PostGIS Mailing List] thread on clustering points</s> 
+  * <s>[http://www.nabble.com/clusteringpointst1404935.html#a3781371 Here] & [http://www.nabble.com/VisualizingPointDatat1052056.html#a2741608 here]: Mapserver Mailing List threads on clustering points</s> 
Latest revision as of 16:16, 12 October 2014
Contents
Point Clustering: Various Approaches
Please fill this in with any approaches that you have tried for Point Clustering along with code snippets. Please include discussion on why a particular method worked well or didn't work well and what circumstances it may be good for.
Possible Approaches
 Coordinate interleaving (i.e. 1. rounding input coordinates, 2. grouping/aggregating them, and then 3. averaging their original coordinates so that the cluster position is at the weighted coordinate of all input geometries).
 Kmeans Clustering
 Hierarchical Clustering
 Distance calculation for each coordinate pair
Input Parameters
Depending on algorithm...
Partitioning methods
 Map grid width ("quare / manhattan world", see coordinate interleaving/rounding)
 Some selfcorrelation threshold (see e.g. kmeans)
 Predefined irregular polygons (e.g. zip code boundaries)
Implementations
References
 Wikipedia Article on Data Clustering
 PyCluster: Python Cluster Functions (2013)
 Point Clustering Utility Trigger enhancement idea reported as ticket to PostGIS Trac (2012).
 "Spatial Clustering with PostGIS from gis.stackexchange.com (2011)
 Using Genetic Algorithms in Clustering Problems: paper from GeoComputation 2000 conference (2000)
 Automatic clustering via boundary extraction for mining massive pointdata sets: paper from GeoComputation 2000 conference (2000)

PostGIS Mailing List thread on clustering points 
Here & here: Mapserver Mailing List threads on clustering points