Clustering automatically groups points by numerical similarity, which can help you to quickly reveal distinct groups within your data. This can be useful for situations like identifying customer segments within sales data or categorizing IT tickets to identify common problems.
The underlying algorithm used in Virtualitics Explore Desktop is the k-means algorithm. Computed clusters can be saved as a new data feature, and for easy differentiation, are automatically mapped to the Color dimension in Mapping View.
Using Clustering
-
Click the Clustering icon ( ) in the top toolbar to open the Clustering panel.
-
Input the number of clusters you'd like to display. By default, Number of Clusters is set to Auto. If you would like to specify an exact number of clusters to find, enter a number between 2 and 16.
-
Select the features you would like to use to determine the clusters by clicking and dragging them from the Features panel into the Add Features section or using the Input All button.
-
(Optional) Remove unwanted features by hovering over the feature and clicking the Remove button ( ).
-
Click Run.
Next Steps
Once you've run Clustering, you'll see that computed clusters are automatically mapped to the Color dimension. A new data feature is also generated called (X) Cluster Result, where X corresponds to the number of times you have run Clustering. You can simply click and drag the newly-created feature to any dimension or to the Features Panel, allowing you use this feature in future mappings.
Based on your mapping settings in Mapping View, you can easily view the clusters on your plot. Additionally, you can use the Insights tool on your newly created Cluster Result data feature to see additional information about that data.
Previous Article |
Next Article |