vignettes/pkgdown/view.Rmd
view.Rmd
Show data as an interactive table
Choose one of the datasets from the Datasets
dropdown. Files are loaded into Radiant through the Data > Manage tab.
There are several ways to select a subset of the data to view. The Filter data
box on the left (click the check-box) can be used with >
and <
symbols. You can also combine subset commands, for example, x > 3 & y == 2
would show only those rows for which the variable x
has values larger than 3 AND for which y
is equal to 2. Note that in R, and most other programming languages, =
is used to assign a value and ==
to determine if values are equal to each other. In contrast, !=
is used to determine if two values are unequal. You can also use expressions that have an OR condition. For example, to select rows where Salary
is smaller than $100,000 OR larger than $20,000 use Salary > 20000 | Salary < 100000
. |
is the symbol for OR and &
is the symbol for AND
It is also possible to filter using dates. For example, to select rows with dates before June 1st, 2014 enter date < "2014-6-1"
into the filter box and press return.
You can also use string matching to select rows. For example, type grepl('ood', cut)
to select rows with Good
or Very good
cut. This search is case sensitive by default. For case insensitive search use grepl("GOOD", cut, ignore.case = TRUE)
. Type your statement in the Filter
box and press return to see the result on screen or an error below the box if the expression is invalid.
It is important to note that these filters are persistent and will be applied to any analysis conducted through in Radiant. To deactivate a filter un-check the Filter data
check-box. To remove a filter simply delete it.
Operator | Description | Example |
---|---|---|
<
|
less than |
price < 5000
|
<=
|
less than or equal to |
carat <= 2
|
>
|
greater than |
price > 1000
|
>=
|
greater than or equal to |
carat >= 2
|
==
|
exactly equal to |
cut == 'Fair'
|
!=
|
not equal to |
cut != 'Fair'
|
|
|
x OR y |
price > 10000 | cut == 'Premium'
|
&
|
x AND y |
carat < 2 & cut == 'Fair'
|
%in%
|
x is one of y |
cut %in% c('Fair', 'Good')
|
is.na | is missing |
is.na(price)
|
Filters can also be used with R-code to quickly view a sample from the selected dataset. For example, runif(n()) > .9
could be used to sample approximately 10% of the rows in the data and 1:n() < 101
would select only the first 100 rows in the data.
By default all columns in the data are shown. Click on any variable to focus on it alone. To select several variables use the SHIFT and ARROW keys on your keyboard. On a mac the CMD key can also be used to select multiple variables. The same effect is achieved on windows using the CTRL key. To select all variable use CTRL-A (or CMD-A on mac).
By default only 10 rows of data are shown at a time. You can change this setting through the Show ... entries
dropdown. Press the Next
and Previous
buttons at the bottom-right of the screen to page through the data.
Click on a column header in the table to sort the data. Clicking again will toggle between sorting in ascending and descending order. To sort on multiple columns at once press shift and then click on the 2nd, 3rd, etc. column to sort by.
For variables that have a limited number of different values (i.e., a factor) you can select the levels to keep from the column filter below the variable name. For example, to filter on rows with ideal cut click in the box below the cut
column header and select Ideal
from the dropdown menu shown. You can also type a string into these column filters and then press return. Note that matching is case-insensitive. In fact, typing eal
would produce the same result because the search will match any part of a string. Similarly, you can type a string to select rows based on character variables (e.g., street names).
For numeric variables the column filter boxes have some special features that make them almost as powerful as the Filter data
box. For numeric and integer variables you can use ...
to indicate a range. For example, to select price
values between $500 and $2000 type 500 ... 2000
and press return. The range is inclusive of the values typed. Furthermore, if we want to filter on carat
0.32 ...
will show only diamonds with carat values larger than or equal to 0.32. Numeric variables also have a slider that you can use to define the range of values to keep.
If you want to get really fancy you can use the search box on the top right to search across all columns in the data using regular expressions. For example, to find all rows that have an entry in any column ending with the number 72 type 72$
(i.e., the $
sign is used to indicate the end of an entry). For all rows with entries that start with 60 use ^60
(i.e., the ^
is used to indicate the first character in an entry). Regular expressions are incredibly powerful for search but this is a big topic area. To learn more about regular expressions see this tutorial.
It is important to note that column sorting, column filters, and search are not persistent. To store these settings for use in other parts of Radiant press the Store
button. You can store the data and settings under a different dataset name by changing the value in the text input to the left of the Store
button. This feature can also be used to select a subset of variables to keep. Just select the ones you want to keep and press the Store
button. For more control over the variables you want to keep or remove and to specify their order in the dataset use the Data > Transform tab.
To download the data in csv format click the icon on the top right of your screen.
Click the report () icon on the bottom left of your screen or press ALT-enter
on your keyboard to add the filter and sort commands used by Radiant to a (reproducible) report in Report > Rmd.
For an overview of related R-functions used by Radiant to view, search, and filter data see Data > View