Related Categories

Load from Disk

Before you can start visualizing your data, you need to load it into the application. Virtualitics supports loading data from tabular text files, CSV files, and Excel spreadsheets from your local machine. Your data needs to be in a machine readable format. Simply put, this means each row has values and the same number of columns.

To load a file, simply click the Local button on the Datasets tab that appears after launching the application and while in the “Spreadsheet View”, and then find the file to load.

Selecting a file will open the “Dataset Preview” window. This window lets you confirm that the data you are loading is what you intended and choose a subset of rows and columns to load. Once you’ve confirmed your settings, select the “Load Dataset” button to finish loading the data.

Supported File Types

  • CSV – Comma Separated Values
  • Text files – Note: Must specify the delimiter used to separate values in the Dataset Preview window
  • Excel sheets – Note: Virtualitics will only load in the first sheet in a workbook with multiple sheets
  • VIP Project – A saved VIP project with a “.vip” extension

Dataset Preview Options

  • Headers – Virtualitics automatically takes the first row of your dataset as names for that column. If your dataset does not have header names, set this to “No Headers” and the application will assign generic numbered names for each column.
  • Delimiter – Allows the user to specify the character that separates values in the data file (defaults to comma). The drop down menu contains a number of commonly used delimiters, as well as a custom option that you can use to enter any combination of characters.
  • Columns/Rows – Enter ranges here to specify which columns and rows to import from the data file.
    • Random subsample – If you have a large dataset and only want to examine a smaller, random subsample, that subsample can be created upon import by specifying either the percentage or number of rows to use.
  • Number Format – Change this for datasets with regional formatting that uses a comma for the decimal separator.