Show data as an interactive table
Choose one of the datasets from the
Datasets dropdown. Files are loaded into Radiant through the Data > Manage tab.
There are several ways to select a subset of the data to view. The
Filter data box on the left (click the check-box) can be used with
< symbols. You can also combine subset commands, for example,
x > 3 & y == 2 would show only those rows for which the variable
x has values larger than 3 AND for which
y is equal to 2. Note that in R, and most other programming languages,
= is used to assign a value and
== to determine if values are equal to each other. In contrast,
!= is used to determine if two values are unequal. You can also use expressions that have an OR condition. For example, to select rows where
Salary is smaller than $100,000 OR larger than $20,000 use
Salary > 20000 | Salary < 100000.
| is the symbol for OR and
& is the symbol for AND
It is also possible to filter using dates. For example, to select rows with dates before June 1st, 2014 enter
date < "2014-6-1" into the filter box and press return.
You can also use string matching to select rows. For example, type
grepl('ood', cut) to select rows with
Very good cut. This search is case sensitive by default. For case insensitive search use
grepl("GOOD", cut, ignore.case = TRUE). Type your statement in the
Filter box and press return to see the result on screen or an error below the box if the expression is invalid.
It is important to note that these filters are persistent and will be applied to any analysis conducted through in Radiant. To deactivate a filter un-check the
Filter data check-box. To remove a filter simply delete it.
||less than or equal to||
||greater than or equal to||
||exactly equal to||
||not equal to||
||x OR y||
||x AND y||
||x is one of y||
Filters can also be used with R-code to quickly view a sample from the selected dataset. For example,
runif(n()) > .9 could be used to sample approximately 10% of the rows in the data and
1:n() < 101 would select only the first 100 rows in the data.
By default all columns in the data are shown. Click on any variable to focus on it alone. To select several variables use the SHIFT and ARROW keys on your keyboard. On a mac the CMD key can also be used to select multiple variables. The same effect is achieved on windows using the CTRL key. To select all variable use CTRL-A (or CMD-A on mac).
By default only 10 rows of data are shown at a time. You can change this setting through the
Show ... entries dropdown. Press the
Previous buttons at the bottom-right of the screen to page through the data.
Click on a column header in the table to sort the data. Clicking again will toggle between sorting in ascending and descending order. To sort on multiple columns at once press shift and then click on the 2nd, 3rd, etc. column to sort by.
For variables that have a limited number of different values (i.e., a factor) you can select the levels to keep from the column filter below the variable name. For example, to filter on rows with ideal cut click in the box below the
cut column header and select
Ideal from the dropdown menu shown. You can also type a string into these column filters and then press return. Note that matching is case-insensitive. In fact, typing
eal would produce the same result because the search will match any part of a string. Similarly, you can type a string to select rows based on character variables (e.g., street names).
For numeric variables the column filter boxes have some special features that make them almost as powerful as the
Filter data box. For numeric and integer variables you can use
... to indicate a range. For example, to select
price values between $500 and $2000 type
500 ... 2000 and press return. The range is inclusive of the values typed. Furthermore, if we want to filter on
0.32 ... will show only diamonds with carat values larger than or equal to 0.32. Numeric variables also have a slider that you can use to define the range of values to keep.
If you want to get really fancy you can use the search box on the top right to search across all columns in the data using regular expressions. For example, to find all rows that have an entry in any column ending with the number 72 type
72$ (i.e., the
$ sign is used to indicate the end of an entry). For all rows with entries that start with 60 use
^60 (i.e., the
^ is used to indicate the first character in an entry). Regular expressions are incredibly powerful for search but this is a big topic area. To learn more about regular expressions see this tutorial.
It is important to note that column sorting, column filters, and search are not persistent. To store these settings for use in other parts of Radiant press the
Store button. You can store the data and settings under a different dataset name by changing the value in the text input to the left of the
Store button. This feature can also be used to select a subset of variables to keep. Just select the ones you want to keep and press the
Store button. For more control over the variables you want to keep or remove and to specify their order in the dataset use the Data > Transform tab.
To download the data in csv format click the icon on the top right of your screen.
Click the report () icon on the bottom left of your screen or press
ALT-enter on your keyboard to add the filter and sort commands used by Radiant to a (reproducible) report in Report > Rmd.
For an overview of related R-functions used by Radiant to view, search, and filter data see Data > View