Clustering automatically groups points by numerical similarity, which can help you to quickly reveal distinct groups within your data. This can be useful for situations like identifying customer segments within sales data or categorizing IT tickets to identify common problems.
The underlying algorithm used in Virtualitics Explore is the k-means algorithm. Computed clusters can be saved as a new data feature, and for easy differentiation, are automatically mapped to the Color dimension in Mapping View.
Using Clustering
-
Click the Clustering icon ( ) in the left-side toolbar to open the Clustering panel.
-
Input the number of clusters you'd like to display. By default, Number of Clusters is set to Auto. If you would like to specify an exact number of clusters to find, enter a number between 2 and 16.
-
Select the features you would like to use to determine the clusters by choosing them from the Select Features drop-down or using the Input All button.
-
(Optional) Remove unwanted features by hovering over the feature and clicking the Remove button ( ).
-
Click Run.
Next Steps
Once you've run Clustering, you'll see that computed clusters are automatically mapped to the Color dimension. A new data feature is also generated called (X) Cluster Result, where X corresponds to the number of times you have run Clustering. You can simply right click the newly-created feature and select Add to Features, which will add this feature to the Features and Filters Panel, allowing you use this feature in future mappings.
Based on your mapping settings in Mapping View, you can easily view the clusters on your plot. Additionally, you can use the Insights tool on your newly created Cluster Result data feature to see additional information about that data.
Previous Article |
Next Article |