Algorithms are designed to solve problems. Over time, new algorithms are created to solve problems that old algorithms have already solved. In some cases, the new algorithms are not intrinsically better than the older ones. In other cases, these new algorithms breathe life into areas of research and engineering that could not exist without them. The question is: what makes an algorithm "better" than another one?
Obviously, there is no good answer to this question. If you write an algorithm to solve a problem, whatever you come up with will probably work just fine for small problems. That said, if you need to use the algorithm for a larger system, you might find yourself waiting for longer and longer on your code to run. In these cases, it's obvious that if you are working on a large system, you need an algorithm that scales well with a large size. Ideally, an algorithm that works well for a large system will also work with a smaller one; however, this is not always the case.
In order to determine the best algorithm for your system, it's often best to consult a tool computer scientists use to describe how algorithms scale with system size: complexity theory.
Here's the idea: algorithms operate on data.
Complexity theory uses different notations to describe how many operations an algorithm will need.
In this way, computational complexity measures runtime in terms of the number of operations an algorithm takes to complete its task.
To be clear, the notations used are not at all exact, but they roughly describe the run-time of code and can be used to estimate how long an algorithm should take to run.
In addition, there are many different notations depending on who you ask, but for now we'll focus on the big 3:
It may seems like strange that an algorithm can run in different time, but let me explain a while:
function constant(a::UInt64, b::UInt64)
println(b)
for i=0:18446744073709551615
if(a < b)
b = a - b
println(b)
end
end
end
If we calculates the big 3 in b, it will be println
statement if a > b, the worst-case runtime will be println
statement if a = 1.
So that's the explanation, and let's move on.
Of the three Big
######In algorithms below, let us consider that the slowest statement is println
, and we always talk about all the println
in the function.
Let's write some code that reads in an array of length n
and runs with constant time:
function constant(a::Array{Float64})
println(a[1])
end
Obviously, no matter how large a
is, this function will not take any longer to run.
Because of this, we say it has a constant runtime and notate it with
function constant(a::Array{Float64})
if (length(a) >= 3)
println(a[1])
println(a[2])
println(a[3])
end
end
This function has 3 print statements, so it has 3 operations total.
Because of this, it's tempting to say that the runtime would be
Now, I know what you are thinking, That's stupid! It's clear that the second function will take 3 times as long to run, shouldn't we notate that? You're not wrong; however, complexity notation is mostly interested in how algorithms scale with larger and larger inputs. Because we are talking about constant run-time, there is no scaling with larger inputs. No matter what array you read in to the above functions, they will always take a constant number of operations to finish. Whether that constant time is 1 operation or 3 operations doesn't really matter because different machines will have different runtimes anyway.
Now, here's the thing: as we move on to more complicated examples, we will continue to ignore constants and extra terms to make the notation easier to understand. Just because this is common practice does not mean it's the best practice. I have run into several situation where knowing the constants has saved me hours of run-time, so keep in mind that all of these notations are somewhat vague and dependent on a number of auxiliary factors. Still, that doesn't mean the notation is completely useless. For now, let's keep moving forward with some more complicated (and useful) examples!
Now we are moving into interesting territory! Let's consider the following function:
function linear(a::Array{Float64})
for i = 1:length(a)
println(a[i])
end
end
Here, it's clear that if we increase a
by one element, we will need to do another operation.
That is, with an array of size for
loop will change the constant in front of
function linear(a::Array{Float64})
println("The first element in our array is: ", a[1])
println("The sum of all pairs of elements in our array are...")
for i = 1:length(a)/2
println("a is: ", a[2*i])
println("b is: ", a[2*i+1])
println("The sum of a and b is: ", a[2*i] + a[2*i+1])
end
println("The last element in our array is: ", a[end])
end
Technically has a complexity of for
loop, which is pretty good!
A promise of
# Here, size is the length of a single side of the image
function access_image(img::Array{Float64}, size::Int64)
for i = 1:size
for j = 1:size
index = j + i*size
println(img[index])
end
end
end
This is a simple case where a nested for
loop is perfectly acceptable, and it's obvious geometrically that we need to access for
loops.
That said, there have been several cases throughout the history of algorithms where polynomial runtimes have inhibited certain algorithms from being used entirely, simply because it takes too long to run!
For this reason, if you can avoid writing nested for
loops, you certainly should!
However, there are several cases where this cannot be avoided, so don't spend too much time worrying about it unless runtime becomes an issue!
These are two more cases that come up all the time and often require a common theme: recursion. Generally speaking, logarithmic algorithms are some of the fastest algorithms out there, while exponential algorithms are some of the slowest. Unfortunately, this means that recursion can be either the most useful tool in existence for realizing certain algorithms or the most harmful one, depending on your problem.
Here is a simple example of a function with exponential runtime:
# Here, n is the number of iterations
function exponential(value::Int64, n::Int64)
println(value)
if(n >= 0)
value += 1
exponential(value, n-1)
exponential(value, n-1)
end
Here, we read in the maximum number n
we are iterating through and recursively call the exponential
function, decrementing the number of iterations left each time.
Because we are calling the exponential
function twice, this has a complexity of
Logarithmic algorithms can be thought of as the opposite of exponential ones. Instead of taking a value and computing more and more values each time, a good example of a logarithmic algorithm is one that takes an array and recursively divides it up, like so:
# Here, cutoff is an arbitrary variable to know when to stop recursing
function logarithmic(a::Array{Float64}, cutoff::Int64)
if (length(a) > cutoff)
logarithmic(a[length(a)/2+1:end], cutoff)
end
println(length(a))
end
To be honest, it is not obvious that the provided logarithmic
function should operate in a
.
That said, I encourage you to think about an array of size 8.
First, we split it in half and run the algorithm on one of them, creating an array of 4 elements.
If we split the new array and run it on 1 of them, we have an array of 2, and if we split it by two and run on 1 we have an array of 1 element each.
This is as far as we can go, and we ended up dividing the array 3 times to get to this point.
We've outlined the most common complexity cases of different algorithms here, but at this point things might still be unclear.
Which is better:
Here, we see each of the complexity cases as
Now, there is a lot more to say about computational complexity and we'll definitely cover it at some point, but I can only move so fast!
In particular, I would love to have a discussion on the
This is a book about algorithms.
It would be nearly impossible to talk about most algorithms without touching on complexity theory and explaining why certain algorithms are faster than others.
That said, just because an algorithm runs in
Basically, take complexity notation with a grain of salt. It is a useful descriptor of how fast algorithms should run in an ideal world; however, ideal worlds do not exist. When it comes to programming, there may be hundreds of other factors that need to be considered before implementing anything. That said, complexity notation should not be ignored. If you can easily implement an algorithm that is notationally faster with no repercussions, go for it! Just be sure you do not waste time trying to optimize code you haven't written yet.
In general, my advice would be the following: write code first and optimize what you can on the first run-through without going too far out of your way. If the runtime is awful, go back and see about implementing algorithms that are faster based on complexity notation.
<script> MathJax.Hub.Queue(["Typeset",MathJax.Hub]); </script>The code examples are licensed under the MIT license (found in LICENSE.md).
The text of this chapter was written by James Schloss and is licensed under the Creative Commons Attribution-ShareAlike 4.0 International License.
- The image "Complexity Scaling" was created by James Schloss and is licensed under the Creative Commons Attribution-ShareAlike 4.0 International License.
After initial licensing (#560), the following pull requests have modified the text or graphics of this chapter:
- none