Difference between revisions of "Data Set File Formats"
From Rave Documentation
(→Working with Excel Files) |
|||
Line 2: | Line 2: | ||
=Working with Excel Files= | =Working with Excel Files= | ||
− | *If you load an Excel file that contains multiple worksheets, each (non-empty) worksheet will be loaded as a separate [[ | + | *If you load an Excel file that contains multiple worksheets, each (non-empty) worksheet will be loaded as a separate [[data set]]. The [[data sets]] will be named after the corresponding worksheets. |
*Excel files take longer to load that plain text files, so if you're working with large [[data sets]] you might want to use .txt files. | *Excel files take longer to load that plain text files, so if you're working with large [[data sets]] you might want to use .txt files. |
Revision as of 16:38, 22 August 2013
Rave can read .txt, .csv, .xls, and .xlsx files. The file must be formatted as a "flat file database," which simply means it has the variable names in the first row, and data records in all remaining rows. Each column must be separated by a comma or tab (or be separate columns in Excel), and each row of the file must have the same number of columns. If data is missing, it should be listed as "nan", not simply left blank.
Working with Excel Files
- If you load an Excel file that contains multiple worksheets, each (non-empty) worksheet will be loaded as a separate data set. The data sets will be named after the corresponding worksheets.
- Excel files take longer to load that plain text files, so if you're working with large data sets you might want to use .txt files.