Chapter 14 Control structures
Control structures allow you to control the flow of execution of a script. They are extremely useful if you want to run a piece of code multiple times, or if you want to run a piece a code if a certain condition is met. Helpful control structures are conditional statements such as the if
, if-else
, and ifelse
statements, and loops such as the for
, while
and repeat
loops. For the R help documentation on these functions, see ?Control
.
14.1 Conditional statements
The if
, if-else
and ifelse
statements have the following syntax:
# if statement
if(condition) {
# do something
}
# if-else statement
if(condition) {
# do something
}else {
# do something else
}
# ifelse statement
ifelse(condition, statement1, statement2)
Condition
should be an expression that evaluates to TRUE
or FALSE
(i.e. a logical expression using relational and logical operators). If the expression returns TRUE
, then the script between the first set of curly brackets { }
will be executed for if
and if-else
, or statement1
will be executed for ifelse
. If the expression returns FALSE
, then either no code will be executed (for if
), or the else
script between the second set of curly brackets will be executed (for if-else
), or statement2
will be executed (for ifelse
). Multiple lines of code can be enclosed by the curly brackets, which are all executed as one block. Thus, the if
and if-else
statements are very suited for executing different (possibly complex) scripts given a condition, yet the ifelse
statement is very suited for simpler statements and are well suited for vectorization.
For example, an if-else
statement could be:
In this very simple example, when this script is executed when the variable x
has a value that is lower than 5, the conditional statement will print "x < 5"
to the screen, while when the variable x
has a value that is not lower than 5, the statement will print "x >= 5"
to the screen.
An example of vectorization with the ifelse
statement is:
14.2 Loops
A loop is a sequence of instructions that is repeated until a certain condition is been reached. for
, while
and repeat
, with the additional clauses break
and next
, are used to construct loops.
14.2.1 For-loops
A for
loop works on an iterator variable and assigns successive values till the end of a sequence, executing code for each iteration in the sequence, with the syntax:
The expression can contain multiple lines of code enclosed within curly brackets. For example, the following code:
for(i in 1:4) {
# some code, here just printing the value i to the console
print(i)
}
## [1] 1
## [1] 2
## [1] 3
## [1] 4
Note that a for
loop will always execute the code at least once when you use the sequence operator :
! Namely, any sequence will than have a length of at least 1, e.g. 1:1
or 0:0
. It can even lead to unwanted behaviour when not being careful, as is the case in the following code:
nrRepetitions <- 0
for(i in 1:nrRepetitions) {
# some code, here just printing the value i to the console
print(i)
}
## [1] 1
## [1] 0
Instead of repeating the code 0 times, this code actually executes the code twice, since the :
operator can and will count backwards!
Thus, always check whether the number of iterations in a for
loop is zero before you run your loop if you use :
operator. If you’re iterating over the indices of a vector, always use function seq_along()
in preference to the :
operator. For example, for a vector x
the function call seq_along(x)
generates a regular sequence of the same length of x
(including 0 length!):
14.2.2 while-loops
A while
loop executes some code as long as a condition is met, with the syntax:
where condition is a logical expression evaluating to TRUE
or FALSE
, and expression can be multiple lines of code. The while
loop will continue to execute as long as the condition evaluates as TRUE
, for example:
Be careful with while
loops: always make sure that the loop will end at some point!
14.2.3 Repeat-loops
A repeat
loop is used to iterate over a block of code multiple numbers of times. There is no condition check in repeat
loop to exit the loop. We thus have to put a condition check explicitly inside the body of the loop, and can use the break
statement to exit the loop. Failing to do so will result in an infinite loop! An example of a repeat
loop:
x <- 1
repeat {
if(x < 5) {
print(x)
x <- x + 1
}else {
break
}
}
## [1] 1
## [1] 2
## [1] 3
## [1] 4
Another statement that allows for the control of execution of code is the next
statement, which enables to skip the current iteration of a loop without terminating it. Note that loops can be nested within loops, which can be very helpful is some iteration code has to be iterated itself, for example:
for(i in 1:4) {
# print i to screen
cat("i =", i, ">> ") # the function cat() allows to print multiple items
for(j in 1:3) {
# print j to screen
cat(j, " ")
}
# print newline
cat("\n")
}
## i = 1 >> 1 2 3
## i = 2 >> 1 2 3
## i = 3 >> 1 2 3
## i = 4 >> 1 2 3
On the other hand, some loops are based on the onset and verification of a logical condition. The condition is tested at the start or the end of the loop construct. These variants belong to the while or repeat family of loops, respectively.
apply
, sapply
, mapply
, lapply
, rapply
, tapply
, vapply
(or the map family of functions from the purrr package). See
here and here for more information.