Programming for fun and profit

Book Review: Cybersecurity Myths and Misconceptions

2024-07-05T00:00:00+02:00

The world of cybersecurity is full of perils. Many of those perils stem from misconceptions about the nature of cybersecurity. If you come to the insight that perfect security is unachievable, you may reach the conclusion that being concerned with cybersecurity at all is a waste of your time. After all, the bad guys will get in anyway. Or you may have the notion that your particular company is too small of a target for anyone to bother with, and so you don't bother to defend yourself appropriately. Or perhaps you experience a security breach and take action before you even know what's going on, because surely, some action is better than no action. If reading about 175 statements of this kind and why they're flawed, then Cybersecurity Myths and Misconceptions: Avoiding the Hazards and Pitfalls that Derail Us may be the book for you.

Cybersecurity Myths and Misconceptions: Avoiding the Hazards and Pitfalls that Derail Us
by Eugene Spafford, Josiah Dykstra, Leigh Metcalf
Released January 2023
Publisher(s): Pearson Education (US)
ISBN: 9780137929238

The book in a nutshell

The book covers 175 myths and misconceptions in cybersecurity. These are divided into 16 chapters covering different topics ranging from faulty assumptions (e.g. "I am too small to be a target"¹) to misconceptions about digital forensics (e.g. "Incidents are discovered as soon as they occur"²).

The vast majority of the myths are presented in the following format.

Present some background information for the topic to make sense.
Make a bold statement (the myth or misconception)
Dismantle the statement and show how it does not hold water
- Although some myths are acknowledge to be partially true, given the right circumstances

In some cases, there's also a small fictitious case study of a manager wanting to apply some technology or rule motivated by the myth. These can be a bit on the nose, but also occasionally hit too close to home to be comfortable.

For the most part, this is not a technical book. Most myths are presented on high enough a level that you do not need to be a software engineer to appreciate it. There are some things that may fly over your head if you're not a software engineer, such as talk about the OSI network model and interprocess communication, but such technical dives are so infrequent that they barely need mention.

What I liked

This book is a leisure read. It does not require you to sit at a computer and apply concepts to benefit from it, all you need to do is read the book and ponder its implications. At the same time, it will surely grant you insights that will have tangible effects on your ability to perform in a technical role.

As a concrete example, there is a section about cognitive biases. One such bias is so-called action bias, where you are prone to take action as a response to a cybersecurity threat, even before you have understood the situation well enough to make a qualified decision about what action to take. In reality, sometimes no action is the best solution, and rather often I'd say that no action is better than the wrong action. While I have experience enough to know not to act prematurely, and sometimes not to act at all, I am definitely predisposed with an action bias. As, I think, are a lot of software engineers. We want to fix things, not just wait for them to resolve themselves. Reading about action bias in the way it is framed in this book has caused me to reflect on how I act, especially in incident response, and I think it will help me take an even more measured approach in the future.

In addition to expanding my own mind, I also found that the book framed a whole lot of the complaints that I've had to managers and peers over the years. For example, the notion that "I'm to small to be a target" is something I've seen a lot, and the next time I come across it I very well may use this book as a reference to attempt to dissuade it.

A lot of what is covered in the book is also applicable to software engineering as a whole (or even business as a whole), rather than just cybersecurity. The sunk cost fallacy³ is universally applicable when it comes to any costly investment, as is the fact that basing decisions on anecdotal evidence isn't necessarily the best idea. There's also this really insightful chapter on how analogies and abstractions can be damaging to understanding when they don't fit well enough. For example, the name firewall comes from physical walls intended to impede the spread of fire. A firewall in computing is built to stop some traffic ("fire"), but also let some traffic through. The latter does not fit the analogy, a physical firewall shouldn't let any fire through.

As a final note on what I liked, there is a running theme in the book that insulting people for their lack of cybersecurity awareness serves no purpose, and that guiding others to better decisions rather than berating bad ones is often the most effective path. However, there is also an undertone that at least some cybersecurity issues stem from software engineers with insufficient skills. When discussing cross-platform compatibility in high-level programming languages, one footnote reads as follows.

We are glossing over many things such as numerical precision, interprocess communication, and character sets. For this discussion, assume those things do not matter, although they certainly do in real life. If you do not know what we mean by that and you are writing production software, then you need to study more software engineering.

This makes it clear that even though the authors believe in helping and educating your peers to the best of your abilities, a base level of understanding is to be expected from your peers. And that understanding is, at least in part, their responsibility to attain.

What I didn't like

There's a large amount of footnotes throughout the book, a lot of which contain useful and interesting snippets of information as well as sources for statements. These, I do not mind at all. But, unfortunately, there is a bit of an oversaturation of footnotes that only contain jokes, puns and witty remarks. While they are all well written and give the book a lighter tone, there are at times simply too many of them.

Don't get me wrong, I don't mind light-hearted writing, and a few of these kinds of footnotes would only have made the book more enjoyable to read. But when you interrupt your reading flow to read a footnote only to be greeted by your third witty remark in a row, it does get a bit annoying.

Conclusions

This book widened my view of software engineering and cybersecurity. While most of the topics were familiar to me from my education and professional experience, some were forgotten, some brand new, and many were simply framed in a way I had previously not considered them in. Given the rather limited amount of effort I put into reading the book, I got a whole lot of value out of it.

If you're a practicing software engineer, you should read this book, especially if you feel like your grasp of cybersecurity topics isn't where it should be. It won't teach you what you need to know on a technical level, but it will help build up the basic mindset you ought to adopt as well as illuminate a lot of what you don't yet know.

If you're a technical manager in charge of software engineers, you should read this book. It will give you better insight into why software engineers sometimes say "no" and hopefully allow you to make more measured decisions when it comes to acquiring new or retaining existing software.

You are not. A lot of cyber mischief isn't even targeted, but rather automated processes scanning vast amounts of the Internet to find servers with known vulnerabilities. ↩
They are not. In 2022, the average time to discovery of a breach was 207 days. ↩
The idea that an existing investment in something warrants further investment into the same thing (see Wikipedia) ↩

PostgreSQL indexing: The basics

2024-04-02T00:00:00+02:00

Back when I was in university, I had a teacher in database systems who also did contract work as a database administrator. He told me that 90% of his work was to just add and reorganize indexes to speed up queries. That was almost 10 years ago, and I'm now inclined to believe him. Having worked on three different systems backed by PostgreSQL databases over the past four years, I've seen my fair share of queries missing indexes. The fallout has ranged from a poor end user experience due to slow responses to entire systems becoming unresponsive as databases grind to a halt under the load of unindexed queries.

Needless to say, indexing is important. I would go as far as to say that if you don't understand the basics of indexing, you shouldn't be writing queries against a relational database in a production system. In this article, I'll walk you through the basics of indexing in PostgreSQL, including how an index actually works. While this is targeted at PostgreSQL and contains a lot of specifics about it, the general principles of indexing presented here are applicable to relational databases in general.

Let's get started.

Note: This article is based on PostgreSQL 16, but the vast majority of the content should be accurate for PostgreSQL 9.6 and newer.

The incredible impact of indexes

To sell you on the fact that indexes are important, let's do a quick experiment. Below I define a simple data model and fill it with some sample data.

CREATE TABLE IF NOT EXISTS data (
  id SERIAL PRIMARY KEY,
  value INT NOT NULL
);

-- seed with 10 million rows where 0 <= value <= 1,000,000
INSERT INTO data (value)
SELECT (RANDOM() * 1000000)::INT
FROM generate_series(1, 10000000);

This gives us 10 million rows, where the value is between 0 and 1 million, thus on average giving us 10 duplicates of each value. To be very clear, the data looks like this.

Note: test=# is my psql prompt. I'm using psql throughout this article to execute queries.

test=# SELECT * FROM data LIMIT 5;
 id | value  
----+--------
  1 | 602761
  2 | 744515
  3 | 725410
  4 | 729718
  5 | 783837
(5 rows)

Let's execute two queries, one where we search for a particular id, and one where we search for a particular value.

test=# \timing
Timing is on.
test=# SELECT * FROM data WHERE id = 560487;
   id   | value  
--------+--------
 560487 | 650515
(1 row)

Time: 0.845 ms

test=# SELECT * FROM data WHERE value = 560487;
   id    | value  
---------+--------
 2068259 | 560487
 5013963 | 560487
 6894022 | 560487
 4305566 | 560487
 8455242 | 560487
 9020004 | 560487
 7313065 | 560487
(7 rows)

Time: 254.458 ms

Searching for the id took less than a millisecond, while searching for the value took a quarter of a second. While you could argue that the database had to do more work in compiling the results for the value search as there were more hits, that shouldn't (and in fact, does not) account for a more than 250x timing difference.

So what's up? Let's have a look at the query plans to find out.

test=# EXPLAIN ANALYZE SELECT * FROM data WHERE id = 560487;
                                                   QUERY PLAN                                                   
----------------------------------------------------------------------------------------------------------------
 Index Scan using data_pkey on data  (cost=0.43..8.45 rows=1 width=8) (actual time=0.016..0.017 rows=1 loops=1)
   Index Cond: (id = 560487)
 Planning Time: 0.058 ms
 Execution Time: 0.030 ms
(4 rows)

test=# EXPLAIN ANALYZE SELECT * FROM data WHERE value = 560487;
                                                     QUERY PLAN                                                      
---------------------------------------------------------------------------------------------------------------------
 Gather  (cost=1000.00..97332.43 rows=11 width=8) (actual time=74.701..257.131 rows=7 loops=1)
   Workers Planned: 2
   Workers Launched: 2
   ->  Parallel Seq Scan on data  (cost=0.00..96331.33 rows=5 width=8) (actual time=114.550..231.494 rows=2 loops=3)
         Filter: (value = 560487)
         Rows Removed by Filter: 3333331
 Planning Time: 0.046 ms
 Execution Time: 257.164 ms
(8 rows)

Going deep into query plans is beyond the scope of this article, but we don't need to go deep here. We only need to see that the id query does an index scan, while the value query does a seq(uential) scan. The former means that the sought value is looked up in an index, while the latter means that the entire table is scanned to find the value we're looking for. We can even see that the query planner¹ decided to do the sequential scan in parallel with two workers, indicating that it's a rather heavy query.

Why was id indexed and not value, you ask? Because the primary key in a PostgreSQL table is always indexed, but that's also the only index you get for free. Let's now add an index to value as well and see if we can make it a bit faster.

test=# CREATE INDEX idx_data_value ON data (value);
CREATE INDEX
test=# SELECT * FROM data WHERE value = 560487;
   id    | value  
---------+--------
 2068259 | 560487
 4305566 | 560487
 5013963 | 560487
 6894022 | 560487
 7313065 | 560487
 8455242 | 560487
 9020004 | 560487
(7 rows)

Time: 1.058 ms

Look at that, as fast as searching for the id! If we have a look at the query plan, we can see that we now do hit an index.

test=# EXPLAIN ANALYZE SELECT * FROM data WHERE value = 560487;
                                                       QUERY PLAN                                                       
------------------------------------------------------------------------------------------------------------------------
 Bitmap Heap Scan on data  (cost=4.52..48.14 rows=11 width=8) (actual time=0.050..0.072 rows=7 loops=1)
   Recheck Cond: (value = 560487)
   Heap Blocks: exact=7
   ->  Bitmap Index Scan on idx_data_value  (cost=0.00..4.52 rows=11 width=0) (actual time=0.034..0.035 rows=7 loops=1)
         Index Cond: (value = 560487)
 Planning Time: 0.189 ms
 Execution Time: 0.122 ms
(7 rows)

This looks a bit different with bitmap this and bitmap that. That's because the query planner decided that it was going to read more data, and thus opted for a slightly different strategy. We're still making good use of the index, though, and if we simply put a limit on the amount of returned values we'll see a pure index scan just like we did with the primary key.

test=# EXPLAIN ANALYZE SELECT * FROM data WHERE value = 560487 LIMIT 1;
                                                         QUERY PLAN                                                          
-----------------------------------------------------------------------------------------------------------------------------
 Limit  (cost=0.43..4.82 rows=1 width=8) (actual time=0.041..0.042 rows=1 loops=1)
   ->  Index Scan using idx_data_value on data  (cost=0.43..48.63 rows=11 width=8) (actual time=0.038..0.039 rows=1 loops=1)
         Index Cond: (value = 560487)
 Planning Time: 0.169 ms
 Execution Time: 0.077 ms
(5 rows)

If you only wanted to be sold on the fact that indexing is important, I think that's been covered already. But just having the effect of something without understanding at least the principles on which it operates will still make it harder than it has to be to apply it in practice. So let's discuss the principles of indexing and work through an example of the most commonly used index type: the B-tree.

Indexing in theory

If you've ever read a non-fiction book, you know in principle what an index is. Because at the back of that book there's a section called "index", and it contains an alphabetically sorted list of terms and phrases used throughout the book and a reference to the page number(s) where you can find their use. It's faster to find where "the rule of three" is defined in the book by looking it up in the index than by scanning through the entire book. Database indexes operate on precisely that assumption; it's faster to look a value up in an index than it is to scan through the entire table.

There are many different kinds of indexes in databases, but by far the most common type is the B-tree index. Inspecting the indexes of the 'datatable, we can see that they are both of typebtree`.

test=# \d data
                            Table "public.data"
 Column |  Type   | Collation | Nullable |             Default              
--------+---------+-----------+----------+----------------------------------
 id     | integer |           | not null | nextval('data_id_seq'::regclass)
 value  | integer |           | not null | 
Indexes:
    "data_pkey" PRIMARY KEY, btree (id)
    "idx_data_value" btree (value)

You may immediately think "oh, a binary tree", which is close, but no cigar. In the following section I'll outline how a B-tree index works and its properties such that you can make effective use of it in your day to day work.

The B-tree

A B-tree is indeed a search tree, and it is in fact rather similar to a binary search tree. Each node, or page² is a sorted list of references either to other pages of the tree, or to columns of the indexed table³

Let's work through the example of searching for value=560487 given the index structure shown in the image below. We start from the root page and search ⁴ through it until we find the segment where 500000 <= value. Semantically, this segment has an exclusive upper limit equal to the inclusive lower limit of the next segment (i.e. 500000 <= value < 600000 in this example). Following the reference to the start of the next page, we search that until we find the segment where 560000 <= value < 561000. This segment refers to a leaf page, which again we search through until we find the desired key that in turn contains references to the sought rows of the table. This search path is reflected in the below image with black lines, whereas the faint gray lines show existing references that are not traversed.

Note that this is just for illustrative purposes. The splitting into segments as shown here is completely made up by me to be easy to illustrate. It's very unlikely to actually be a good split and therefore not the one that PostgreSQL would actually make.

In addition to providing quick lookup to specific values, the fact that B-trees are sorted leads to some possibly unexpected benefits in queries that require sorted output.

Selecting ranges is almost as fast as single values

Did you notice in the illustration above that the leaf page contained not only the value we searched for, but also its closest neighbors? Because of this adjacency, an index lookup for a range of values is incredibly efficient⁵.

test=# SELECT * FROM data WHERE 560486 <= value AND value < 560489;
   id    | value  
---------+--------
 1061367 | 560488
 1451289 | 560488
 2068259 | 560487
 2250572 | 560488
 2298922 | 560486
 2998709 | 560486
 3149734 | 560486
 3385911 | 560486
 3552143 | 560488
 4123068 | 560488
 4305566 | 560487
 4599351 | 560488
 5013963 | 560487
 5490314 | 560488
 5521774 | 560488
 5715474 | 560486
 5725443 | 560488
 6125940 | 560486
 6423931 | 560486
 6752395 | 560488
 6894022 | 560487
 7313065 | 560487
 7365878 | 560488
 7956607 | 560488
 8046840 | 560486
 8290267 | 560488
 8455242 | 560487
 8941102 | 560488
 9020004 | 560487
 9663961 | 560488
(30 rows)

Time: 1.360 ms

A very common use case for a range selection is on creation date, e.g. `WHERE created_at >= '2024-04-01'.

Indexes greatly speed up `ORDER BY`

Tacking on an ORDER BY to a query is commonplace and pretty much required if you plan to do any kind of pagination on the result set⁶. Without an index on value, ordering by it takes quite a bit of time.

test=# DROP INDEX IF EXISTS idx_data_value;
DROP INDEX
test=# SELECT * FROM DATA ORDER BY value LIMIT 10;
   id    | value 
---------+-------
 1453865 |     0
 8150282 |     0
 7796750 |     0
 3782708 |     0
 8571152 |     1
 1650656 |     1
 9165134 |     1
  268472 |     1
 5889252 |     1
 3330397 |     1
(10 rows)

Time: 331.239 ms

But tacking on the index, it's again orders of magnitude faster.

test=# CREATE INDEX idx_data_value ON data (value);
CREATE INDEX
test=# SELECT * FROM DATA ORDER BY value LIMIT 10;
   id    | value 
---------+-------
 1453865 |     0
 3782708 |     0
 7796750 |     0
 8150282 |     0
  268472 |     1
 1650656 |     1
 2191333 |     1
 3330397 |     1
 5364764 |     1
 5889252 |     1
(10 rows)

Time: 1.628 ms

This is so fast because there's actually no ordering going on; PostgreSQL just scans the already sorted index.

test=# EXPLAIN ANALYZE SELECT * FROM DATA ORDER BY value LIMIT 10;
                                                               QUERY PLAN                                                               
----------------------------------------------------------------------------------------------------------------------------------------
 Limit  (cost=0.43..0.81 rows=10 width=8) (actual time=0.049..0.074 rows=10 loops=1)
   ->  Index Scan using idx_data_value on data  (cost=0.43..372326.50 rows=10000000 width=8) (actual time=0.046..0.069 rows=10 loops=1)
 Planning Time: 0.161 ms
 Execution Time: 0.104 ms
(4 rows)

Time: 1.284 ms

There is simply a lot of value to having an index that is already ordered⁷ as so many common operations rely on ordering.

Indexing pitfalls

Thus far, this article has mostly covered the happy path of indexing, when everything just works out. It may have given you the impression that indexing is a silver bullet to any query running a bit slow. Unfortunately, that's far from the case, so in this section I will outline a few scenarios that may cause confusion among up and coming indexers.

The query planner can choose not to use an index

Indexes are great, but sometimes the query planner may choose to ignore them and just scan the table instead. This is quite easy to show, take for example the following query where we search for any value > 500000.

test=# EXPLAIN ANALYZE SELECT * FROM data WHERE value > 500000;
                                                   QUERY PLAN                                                    
-----------------------------------------------------------------------------------------------------------------
 Seq Scan on data  (cost=0.00..169248.00 rows=5044664 width=8) (actual time=5.213..653.177 rows=4998152 loops=1)
   Filter: (value > 500000)
   Rows Removed by Filter: 5001848
 Planning Time: 0.151 ms
 JIT:
   Functions: 2
   Options: Inlining false, Optimization false, Expressions true, Deforming true
   Timing: Generation 0.379 ms, Inlining 0.000 ms, Optimization 0.356 ms, Emission 3.469 ms, Total 4.204 ms
 Execution Time: 770.366 ms
(9 rows)

The query planner chooses to forego the index and just scan the table (Seq Scan on data). Why? Recall again the example trace of the B-tree index in the first section of this article; it requires resolving a whole bunch of references and accessing data both from the index and the table⁸, which are stored separately. When accessing large parts of the table, it's often faster to just scan through the entire table than it is to resolve all of those references. The condition value > 500000 corresponds to roughly half of the table⁹, so the query planner does not use the index.

If we increase the value, the query planner will re-evaluate and use the index as it estimates that a small enough part of the table needs to be scanned in the end.

 Bitmap Heap Scan on data  (cost=49658.83..144439.73 rows=4042632 width=8) (actual time=196.822..663.746 rows=4000401 loops=1)
   Recheck Cond: (value > 600000)
   Heap Blocks: exact=44248
   ->  Bitmap Index Scan on idx_data_value  (cost=0.00..48648.17 rows=4042632 width=0) (actual time=189.479..189.479 rows=4000401 loops=1)
         Index Cond: (value > 600000)
 Planning Time: 0.157 ms
 JIT:
   Functions: 2
   Options: Inlining false, Optimization false, Expressions true, Deforming true
   Timing: Generation 0.367 ms, Inlining 0.000 ms, Optimization 0.000 ms, Emission 0.000 ms, Total 0.367 ms
 Execution Time: 756.080 ms
(11 rows)

Now the index is used, but the query runs almost precisely as fast at 756 ms, compared to 770 ms for the full table scan. Considering it's also fetching fewer rows due to a narrower search, the fact that the query planner chose to run the previous plan without touching the index seems justified.

Note that this is all circumstantial, and that's entirely the point. The query planner will come up with different query plans depending on the layout of the data and the exact parameters of the query, even with all other server and client settings being the same. A consequence of this is query plans can change dramatically as data in your tables accumulate. This also means that the query plans you get in your local development environment are often different from the ones you get in production. As such, one can easily be fooled by a query that gets a terrible looking plan in your local environment but actually runs fine in production, and vice versa¹⁰. Be aware of this when you add new indexes and are perplexed as to why they are not used; there may just not be enough data for it to be worth it.

An index is for an exact expression

We have an index on data(value) and showed that ordering by value is super quick. Let's do it again so you don't have to scroll up too far.

test=# SELECT * FROM data ORDER BY value LIMIT 10;
   id    | value 
---------+-------
 1453865 |     0
 3782708 |     0
 7796750 |     0
 8150282 |     0
  268472 |     1
 1650656 |     1
 2191333 |     1
 3330397 |     1
 5364764 |     1
 5889252 |     1
(10 rows)

Time: 0.455 ms

But what if we order by value * 2?

test=# SELECT * FROM data ORDER BY value * 2 LIMIT 10;
   id    | value 
---------+-------
 3782708 |     0
 1453865 |     0
 7796750 |     0
 8150282 |     0
 5889252 |     1
 3330397 |     1
 9165134 |     1
 5364764 |     1
 8571152 |     1
  268472 |     1
(10 rows)

Time: 540.665 ms

It takes 1000 times as long. Looking at the query plan, we can see why.

test=# EXPLAIN ANALYZE SELECT * FROM data ORDER BY value * 2 LIMIT 10;
                                                                 QUERY PLAN                                                                 
--------------------------------------------------------------------------------------------------------------------------------------------
 Limit  (cost=187371.53..187372.70 rows=10 width=12) (actual time=691.290..696.029 rows=10 loops=1)
   ->  Gather Merge  (cost=187371.53..1159661.72 rows=8333334 width=12) (actual time=682.579..687.316 rows=10 loops=1)
         Workers Planned: 2
         Workers Launched: 2
         ->  Sort  (cost=186371.51..196788.18 rows=4166667 width=12) (actual time=659.729..659.730 rows=7 loops=3)
               Sort Key: ((value * 2))
               Sort Method: top-N heapsort  Memory: 25kB
               Worker 0:  Sort Method: top-N heapsort  Memory: 25kB
               Worker 1:  Sort Method: top-N heapsort  Memory: 25kB
               ->  Parallel Seq Scan on data  (cost=0.00..96331.33 rows=4166667 width=12) (actual time=1.799..312.822 rows=3333333 loops=3)
 Planning Time: 0.250 ms
 JIT:
   Functions: 7
   Options: Inlining false, Optimization false, Expressions true, Deforming true
   Timing: Generation 1.026 ms, Inlining 0.000 ms, Optimization 0.738 ms, Emission 13.207 ms, Total 14.971 ms
 Execution Time: 696.794 ms
(16 rows)

Again, there is no need for a deep understanding of query plans to see that this is worse than the simple index scan shown before. Here we've both got sequential scanning of the entire table and a bunch of sorting going on, ending up taking a whole lot of time. The reason the query plan looks like that is that value * 2 is not indexed, only value is. It's the exact expression used in the index creation that is actually indexed and can be used in WHERE, ORDER BY, GROUP BY etc. This should be fairly evident given how a B-tree index lookup works; we cannot quickly lookup an arbitrary expression using a column based on an index of the raw column.

This example is of course silly, because ordering by a value or ordering by double that value results in the same order. In a more realistic scenario, there are many good reasons to use expressions in ordering or searching. For example, if you want to perform a case insensitive search on a text column that contains mixed casing, calling LOWER() on that column makes a whole lot of sense. Such searches can be optimized using an expression index (a.k.a functional index), but that is something I plan to cover in an upcoming article. For the purposes of this article, I only want to make it very clear that a column having an index on it does not imply that any search on that column can utilize said index.

Indexes optimize reads but slow down writes

Something to keep in mind with indexing is that an index is made to speed up reads. On the flip side, they slow down writes as every time a table is written to its indexes must also be written to¹¹

That said, you probably don't need to worry much about indexes taking up space or slowing down writes. I have personally only had to think about such things with indexes for very large tables with billions of rows, or tables with very high throughput demands. The vast majority of applications have neither of those, while the lack of indexes becomes problematic even for small amounts of data. In other words, the benefits of indexing almost always outweighs the cons, but it's good to be aware that cons exist.

Summary

In this article, we've covered the basics of indexing in PostgreSQL, had a theoretical look at a B-tree index and its properties as well as how these can be taken advantage of. In addition, we've looked at some common pitfalls with indexing and some things to keep in mind. That may seem like a lot more than just basics, but the unfortunate reality is that indexing is a complex subject. We haven't delved into other types of indexes or multicolumn indexes. We also haven't touched upon partial indexes or how indexes interact with more complex queries, such as joins. Simply put, there is a lot more to cover.

I plan to continue this article series with shorter articles that cover specific use cases where these more advanced indexing techniques are useful. If you're eager to learn more before I get around to that, the PostgreSQL index documentation is a great place to start!

The query planner (or optimizer) is the part of PostgreSQL that's responsible for taking your high-level SQL query and figuring out the fastest way to execute it in the database (source). ↩
In PostgreSQL, the nodes of a B-tree are called pages (source) ↩
Pages in the B-tree are either internal pages or leaf pages. An internal page only contains references to other B-tree pages, while a leaf page only contains references to the indexed table. There are no hybrid internal/leaf pages. The root can be either internal or a leaf depending on the size of the index. There is also a special metapage that keeps track of things like tree depth and which page is the root of the tree (source). A search in a B-tree index starts from the singular root page and then proceeds through internal pages in the tree until a leaf page containing references to the indexed table is found. From there, the references to the table can be resolved and the rows retrieved. ↩
The page is sorted, PostgreSQL makes effective use of binary search to quickly find a key ↩
Depending on the amount of hits, scanning the rows from the table may of course take longer, however. ↩
The order in which rows are returned from a query is undefined unless ORDER BY is provided. A common bug in backend systems is that OFFSET and LIMIT are used for pagination either without explicit ordering or with ordering that is not unique. ↩
Note that only the B-tree type of index has this property (source). ↩
Except for so-called index-only scans (source) ↩
The query planner knows this as PostgreSQL keeps statistics on table contents (source). ↩
The query planner can also come up with terrible queries if table statistics are too outdated, but that's a topic for another time. ↩
Except for heap-only tuples (source). Indexes can also take up a significant amount of storage. I once worked with a production database where the largest table was around 2 terabytes in size, with an additional 3 terabytes of indexes. Adding a new index was something we thought long and hard about before actually doing. ↩

Configuring touchpad tap in Sway

2024-03-24T00:00:00+01:00

One thing that didn't carry over from my i3 setup when moving over to Sway (i.e. Wayland) was the touchpad configuration. Input devices was a display server concern under X.Org, so the window manager had nothing to do with it. Notably, now click-on-tap is not working for me, which is massively annoying.

If you're just here for the touchpad click-on-tap configuration, it looks like this.

input "type:touchpad" {
    dwt enabled
    dwtp enabled
    tap enabled
    tap_button_map lrm
}

But if you want to improve your understanding of how to configure input devices such as touchpads and mice in a Wayland compositor, then read on!

Configuring a libinput device in Sway

We need to add a libinput¹ configuration into the Sway config file, which is usually located at ~/.config/sway/config. The Sway wiki has a small example for libinput devices but it doesn't really say all that much. Generally, an input configuration should look like this.

input <selector> {
    option value
}

For the selector, we can use either the device identifier, or the type of the device. I prefer the device type, i.e. "type:touchpad", as I use the same configuration file for multiple devices that don't share touchpad identifiers².

You can run swaymsg -t get_inputs to get information about your devices. You will get a ton of output, but somewhere you'll find a device that has Type: Touchpad. Mine looks like this.

$ swaymsg -t get_inputs
...
Input device: SYNA1D31:00 06CB:CD48 Touchpad
  Type: Touchpad
  Identifier: 1739:52552:SYNA1D31:00_06CB:CD48_Touchpad
  Product ID: 52552
  Vendor ID: 1739
  Libinput Send Events: enabled
...

We can however get a whole lot more information about each device with the --raw option³. Combining that with jq⁴ we can get only touchpads out of the command. On all my devices, I only have a single touchpad device. It should look something like this.

$ swaymsg -t get_inputs --raw | jq '.[] | select(.type=="touchpad")'
{
  "identifier": "1739:52552:SYNA1D31:00_06CB:CD48_Touchpad",
  "name": "SYNA1D31:00 06CB:CD48 Touchpad",
  "vendor": 1739,
  "product": 52552,
  "type": "touchpad",
  "scroll_factor": 1.0,
  "libinput": {
    "send_events": "enabled",
    "tap": "disabled",
    "tap_button_map": "lrm",
    "tap_drag": "enabled",
    "tap_drag_lock": "disabled",
    "accel_speed": 0.0,
    "accel_profile": "adaptive",
    "natural_scroll": "disabled",
    "left_handed": "disabled",
    "click_method": "button_areas",
    "middle_emulation": "disabled",
    "scroll_method": "two_finger",
    "dwt": "enabled",
    "dwtp": "enabled"
  }
}

This gives us everything we need to configure the touchpad, and we can also see if our configuration takes effect correctly. The options shown under libinput is what we have to play with. The names aren't necessarily self-explanatory, but you can find decent descriptions of them in the sway-input(5)⁵man page under the LIBINPUT CONFIGURATION section.

I'm going to set the options that are crucial for how I use a touchpad, regardless of what their defaults are. These options are:

input "type:touchpad" {
    tap enabled         # enables click-on-tap
    tap_button_map lrm  # tap with 1 finger = left click, 2 fingers = right click, 3 fingers = middle click
    dwt enabled         # disable (touchpad) while typing
    dwtp enabled        # disable (touchpad) while track pointing
}

Putting this in my ~/.config/sway/config⁶, reloading the environment and then checking the settings shows that tap is now enabled.

$ swaymsg reload
$ swaymsg -t get_inputs | jq '.[] | select(.type=="touchpad")'
{
  "identifier": "1739:52552:SYNA1D31:00_06CB:CD48_Touchpad",
  [...]
  "libinput": {
    [...]
    "tap": "enabled",
    "tap_button_map": "lrm",
    [...]
    "dwt": "enabled",
    "dwtp": "enabled"
  }
}

And that's that, click-on-tap now works again!

Summary

In this article, I outlined how to configure a touchpad in Sway. The principles shown here carry over to any kind of libinput device, and should also be applicable in most other Wayland compositors.

libinput is a stack for common input devices, such as keyboards, touchpads and mice. Read more in the Wayland wiki. ↩
Note that the type selector will select all devices of that type. If you have multiple devices that identify as touchpads, you may want to be more specific and use the identifier. ↩
When piping swaymsg output to another command, --raw is implicit. I use it here explicitly for clarity. ↩
If you don't know how to use jq, do yourself the biggest favor of the year and learn it. ↩
Run man 5 sway-input to get the given page. See my article on man page sections if the 5 is confusing to you. ↩
To see exactly how I use this in my Sway config, refer to my config repository. ↩

Syntax highlight anything with Tree-sitter

2024-03-11T00:00:00+01:00

As of my previous post on extending NeoVim for commenting and uncommenting code blocks, I'm on something of a NeoVim extension streak. The flavor of the week is syntax highlighting. I've been using the highly customizable nvim-treesitter for the past few years. This depends on there being a Tree-sitter parser for whatever language you're working with. Usually there is, but sometimes you run into languages that are esoteric enough that there aren't any parsers available. And then you're SOL on the whole syntax highlighting part. Unless, of course, you write a parser of your own. Which naturally is what we'll do.

This is the first article in a series on working with Tree-sitter to syntax highlight anything. Although this article series is intended for a NeoVim-inclined audience, this first part has nothing to do with NeoVim and can be enjoyed by anyone interested in Tree-sitter or parsers in general. In the still under construction second part, we'll dive into working with and utilizing Tree-sitter in NeoVim.

Companion repository: The complete parser developed in this article is available in a companion repository at slarse/tree-sitter-mds

What's this Tree-sitter thing?

Tree-sitter is a parser generator tool for the modern era. What's a parser generator, you ask? Well, it's exactly what it sounds like: a tool to generate a parser! By defining the syntax of a language in a way the parser generator understands, it can generate a parser for you that can take source code of that language and transform it into a syntax tree.

Parser generators aren't a new concept. GNU Bison has been around for 38 years at this point, and YACC had been around for a decade by the time Bison arrived. So, not at all a new concept. What's novel with Tree-sitter is both how easy it is to define a grammar as long as you know a little bit of JavaScript and the fact that syntax highlighting is a built-in feature.

You can do a lot of cool things with Tree-sitter's syntax trees, but in this article our focus is on syntax highlighting.

This article assumes that you have access to a bash-like shell (such as bash or zsh). Commands intended to be executed in a shell are prefixed with $. Lines that follow a $-prefixed line but are not prefixed are output lines.

Working example: Markdown Simple

As a driving example for this article series, we'll look at a tiny subset of Markdown, which should be familiar to most anyone in the target audience. Specifically, we want to be able to highlight the following containing headings, paragraphs, inline code and code blocks.

# This is a heading
This little paragraph of text with `inline code`

```
const words = ["javascript", "code", "highlighting"];

for (const word of words) {
    console.log(word);
}
```

# This is another heading
Another paragraph with a # in the middle.

Store these sample file contents in test.mds for later use!

I call this subset Markdown Simple and choose to write it in files with the .mds file extension. At the end of this article, we'll have fully functioning syntax highlighting for .mds files, including properly highlighted JavaScript code in the multiline code blocks!

Getting started with creating Tree-sitter parsers

Tree-sitter's docs have a good Project setup section, but I found that it lacks a few key ingredients for our purposes. I will offer an augmented version here.

As a pre-requisite, you must have decently up-to-date versions node and npm installed. With that, we can get started creating a project directory.

$ mkdir tree-sitter-mds
$ cd tree-sitter-mds
$ npm init

The npm init command will prompt you for a bunch of stuff. If you don't know what to choose, just go with the defaults, it hardly matters for the rest of this article. Now, you need the tree-sitter-cli application, which you can also get with npm.

$ npm --save-dev tree-sitter-cli

Then setup a configuration file for tree-sitter-cli.

$ npx tree-sitter init-config

Note where the configuration file is written to. You may need to edit it later for your project to be located by Tree-sitter.

Baby's first grammar rule

As previously mentioned, a parser generator reads a grammar that defines some language and spits out a parser for it. In Tree-sitter, we define grammar rules in JavaScript, in a file called grammar.js in the root of your project. It should have the following structure.

module.exports = grammar({
  name: 'markdownsimple',

  rules: {
      // grammar rules go in here
  }
});

If you now try to generate a parser you should get an error about there not being any rules.

$ npx tree-sitter generate
[stdin]:409
    throw new Error("Grammar must have at least one rule.");

Let's define a very simple grammar just to be able to parse the file.

rules: {
    source_file: $ => $.text,

    text: _ => /(.|\n)+/,
}

Put the above rules into your grammar.js file and then generate and run the parser.

$ npx tree-sitter generate
$ npx tree-sitter parse test.mds

This should produce a simple syntax tree.

(source_file [0, 0] - [13, 0]
  (text [0, 0] - [13, 0]))

Let's dissect the grammar in detail so we can improve the syntax tree granularity. The root node is the source_file. Every syntax tree is rooted in this node, and there is only one of them for each parsed file. The rule definition may look a bit strange at first sight.

source_file: $ => $.text

Any non-trivial grammar is built by composing rules. Any rule that references another rule is a so-called non-terminal. The above rule says that the source_file consists of precisely one text node. So source_file is a non-terminal that references text.

We then define the text rule using regex.

_ => /(.+|\n)/,

This regex just captures any character (.) or line feeds (\n). As the rule does not reference any other rules, it is a so-called terminal.

Note: Tree-sitter supports LR(1) grammars, which limits the kinds of regex expressions that can be used. The finer details of this is beyond the scope of this article.

In summary, our grammar currently defines a source_file as a single text node. Of course, we could immediately define source_file as the regex that currently resides in text at this point; I just wanted to illustrate the concept of terminal and non-terminal rules as it will become important in the next part.

Let there be color

The goal of this article is to achieve nice syntax highlighting, so let's get started immediately. We can highlight a file using the highlight command, but it won't work out of the box.

If you run the below command, you should get an error message.

$ npx tree-sitter highlight test.mds
No language found for path `test.mds`

If a language should be associated with this file extension, please ensure the path to `test.mds` is inside one of the following directories as specified by your 'config.json':

  1. /home/slarse/github  
  2. /home/slarse/src  
  3. /home/slarse/source  
  4. /home/slarse/projects  
  5. /home/slarse/dev  
  6. /home/slarse/git

If the directory that contains the relevant grammar for `test.mds` is not listed above, please add the directory to the list of directories in your config file, located at /home/slarse/.config/tree-sitter/config.json

To resolve this we need to add some information to package.json. Put the following in there.

Tip: It's easy to break the package.json file with a misplaced comma. To ensure that you don't break the syntax of package.json, run npm i until it doesn't complain anymore.

  "tree-sitter": [
    {
      "scope": "source.markdownsimple",
      "file-types": [
        "mds"
      ],
      "highlights": [
        "queries/highlights.scm"
      ]
    }
  ],

Now run the highlight command again. If you get the same error as before, you need to add the directory that contains you project's directory to Tree-sitter's config.json file. For example, my project is located in /home/slarse/projects/tree-sitter/tree-sitter-mds, so I add the path /home/slarse/projects/tree-sitter to the list of paths at the top of /home/slarse/.config/tree-sitter/config.json.

With that fixed, you should get a new error.

$ npx tree-sitter highlight test.mds
Failed to read query file "queries/highlights.scm"

Caused by:
    No such file or directory (os error 2)

This means your project was correctly located. We now need to define queries for our highlight. Queries allow us to select nodes in the syntax tree and assign pre-defined semantic meaning to them. Put the following content in the file queries/highlights.scm in your project directory.

(text) @string

I won't go into detail on queries in this article. You can find all captures (e.g. @string) available over here, and you can read more about queries as a whole over here.

This says that any text node should be considered a string. Running the highlight command again, you ought to get some colored output.

Not all that impressive; as all nodes with any content are text nodes everything is just highlighted as a string. In my Tree-sitter configuration the string color happens happens to be a shade of green, but it may be different for you. For more granular highlighting, we need a more granular syntax tree to perform queries against.

Refining the grammar

The basic pieces of the puzzle are now in place. We have a rudimentary syntax tree allowing for an equally rudimentary syntax highlighting. It's now time to start refining the grammar to capture different components. With some crude annotations in (), we want a structure like this.

(section)
# This is a heading (heading)
This little paragraph of text with `inline code` (paragraph (text) (inlineCode (codeText)))

``` (codeBlock (codeText))
const words = ["javascript", "code", "highlighting"];

for (const word of words) {
    console.log(word);
}
```

# This is another heading (heading)
And this is another little paragraph. (paragraph (text))

Let's start off only with capturing the additional section and heading nodes with new rules.

rules: {
  source_file: $ => repeat($.section),

  section: $ => seq(
    $.heading,
    repeat($.text),
  ),

  heading: _ => /#.+/,

  text: _ => /(.|\n)+/,
}

Our non-terminals source_file and section bring in ways of composing rules with the repeat() and seq() composers. repeat(rule) means "repeat rule zero or more times, while seq(rules...) means "match these rules in this order". So a source_file node is zero or more section nodes, and a section node is a heading followed by zero or more text nodes.

Generating and running the parser again, this lands us with a slightly more granular syntax tree where we separately capture the section's heading.

(source_file [0, 0] - [13, 0]
  (section [0, 0] - [13, 0]
    (heading [0, 0] - [0, 19])
    (text [0, 19] - [13, 0])))

Clearly there's a bug here: we only have one section, yet our source file contains two. The reason for this is that text is too liberal, it captures # characters anywhere on the line. We must adjust the regex to disallow # directly after line feeds.

text: _ => /([^#]|[^\n]#)+/,

Here we say that text is either not a #, or it's a non-linefeed character followed by #. Generating and parsing again we should now have our basic section structure mapped out.

(source_file [0, 0] - [13, 0]
  (section [0, 0] - [11, 0]
    (heading [0, 0] - [0, 19])
    (text [0, 19] - [11, 0]))
  (section [11, 0] - [13, 0]
    (heading [11, 0] - [11, 25])
    (text [11, 25] - [13, 0])))

Great! We can now also adjust the query in queries/highlights.scm to only highlight headings.

(heading) @string

This should land you with the following highlighting.

Nice!

Capturing inline code and code blocks

The only thing left for us to capture is inline code and code blocks. This is however where it starts to get a little bit complicated, so let's take each of these in turn.

Capturing code blocks

Code blocks are relatively simple to capture in isolation. A rule for code blocks could look as follows.

codeBlock: $ => seq(
  '```',
  $.codeText,
  '```',
),

codeText: _ => /[^`]*/,

We could inline codeText into the codeBlock, but making it a separate rule will make it easier for us to highlight the content of the code block without affecting the backticks around it. Note that codeText is not quite the same as text; the former allows any zero or more characters except for backticks. This will of course make it impossible to use backticks inside of a codeBlock, which is a simplification we're going with here. Extending the rule to allow for backticks within a codeBlock node is left as an exercise to the reader.

We can now extend our section rule to contain codeBlocks.

section: $ => seq(
  $.heading,
  repeat(
    choice(
      $.text,
      $.codeBlock,
    )
  ),
),

If you put this in and run it, you may be a bit surprised to see that the output doesn't change from before. The syntax tree still consists of two sections containing one heading and text each. This is because text is consuming the backticks that should delimit a codeBlock. We'll address this at the same time as we define our paragraphs in the next part.

Paragraphs with text and inline code

We want to define grammar rules such that the parser can recognize paragraphs that are composed of one or more text and/or inlineCode nodes. The inlineCode rule is quite simple: a backtick, followed by zero or more characters that are anything but a backtick (i.e. codeText), followed by a backtick.

inlineCode: $ => seq(
  '`',
  $.codeText,
  '`',
),

We also need to alter the text terminal to not allow backticks, because otherwise it will just consume them like it did the # before. This will also allow our codeBlock rule to actually consume something.

text: _ => /([^#`]|[^\n]#)+/,

Now we can define the paragraph non-terminal which, again, should be one or more inlineCode or text.

paragraph: $ => repeat1(
  choice(
    $.inlineCode,
    $.text,
  ),
),

And then we update the section to repeat paragraph instead of text.

section: $ => seq(
  $.heading,
  repeat(
    choice(
      $.paragraph,
      $.codeBlock,
    )
  ),
),

We end up with the following rule set.

rules: {
  source_file: $ => repeat($.section),

  section: $ => seq(
    $.heading,
    repeat(
      choice(
        $.paragraph,
        $.codeBlock,
      )
    ),
  ),

  paragraph: $ => repeat1(
    choice(
      $.inlineCode,
      $.text,
    ),
  ),

  inlineCode: $ => seq(
    '`',
    $.codeText,
    '`',
  ),

  codeBlock: $ => seq(
    '```',
    $.codeText,
    '```',
  ),

  codeText: _ => /[^`]*/,

  heading: _ => /#.+/,

  text: _ => /([^#`]|[^\n]#)+/,
}

If you try this out, you'll find that it doesn't work, Tree-sitter won't even generate a parser for you. Instead, it complains about a conflict. Oh, dear.

Resolving a conflict in the grammar

A conflict in the grammar occurs when there are multiple ways for Tree-sitter to interpret the rules you've provided it with. In the case of our paragraph rule, Tree-sitter has the following to say.

$ npx tree-sitter generate
Unresolved conflict for symbol sequence:

  heading  paragraph_repeat1  •  '`'  …

Possible interpretations:

  1:  heading  (paragraph  paragraph_repeat1)  •  '`'  …
  2:  heading  (paragraph_repeat1  paragraph_repeat1  •  paragraph_repeat1)

Possible resolutions:

  1:  Specify a left or right associativity in `paragraph`
  2:  Add a conflict for these rules: `paragraph`

This is a little bit hard to interpret if you haven't worked a lot with parsers before. The short of it is that our use of the paragraph rule is ambiguous due to the paragraph being repeated in the section rule, and then is itself a repetition. For example, how should the first paragraph of the first section be parsed, as shown below, be parsed?

This little paragraph of text with `inline code`

Is it one paragraph containing text and inlineCode, i.e. (paragraph (text) (inlineCode))? Or is it perhaps two paragraphs, one with the text and one with the inlineCode, i.e. (paragraph (text)) and (paragraph (inlineCode))? It's impossible to tell based on the rules we've defined.

To get rid of this ambiguity, we can use associativity. As associativity is a bit of a brain twister the first time around, we won't bother trying to fully understand it here. We'll just try both and see what happens. We can make a rule left- or right-associative by wrapping it in prec.left() or prec.right(), respectively. Doing that in turn, we get the following two parse results for this part of the tree.

; left-associative result
(paragraph (text))
(paragraph (inlineCode))

; right-associative result
(paragraph (text) (inlineCode))

The right-associative version looks like what we sketched out before embarking down this road, so we'll adjust the rule accordingly.

paragraph: $ => prec.right(repeat1(
  choice(
    $.inlineCode,
    $.text,
  ),
)),

And with that, we can now create a syntax tree with a sufficient level of granularity to achieve the highlighting we set out to. If you generate the parser again and run it, you should see a tree like this.

(source_file [0, 0] - [13, 0]
  (section [0, 0] - [9, 3]
    (heading [0, 0] - [0, 19])
    (paragraph [0, 19] - [1, 48]
      (text [0, 19] - [1, 35])
      (inlineCode [1, 35] - [1, 48]
        (codeText [1, 36] - [1, 47])))
    (codeBlock [1, 48] - [9, 3]
      (codeText [3, 3] - [9, 0])))
  (section [9, 3] - [13, 0]
    (heading [9, 3] - [11, 25])
    (paragraph [11, 25] - [13, 0]
      (text [11, 25] - [13, 0]))))

We still only have a single highlight query, though, so next up is improving that.

Improving the highlight queries

We now have a significantly more granular syntax tree to work with. As a first attempt, let's color inline code including backticks with one color, code block backticks with one color and finally the code blocks themselves with another color.

(heading) @string
(inlineCode) @property
(codeBlock) @punctuation.delimiter
(codeBlock (codeText) @module)

Here we have our first non-trivial use of queries, as we have two queries referencing the same type of node: codeBlock. The first query marks the entire codeBlock with @punctuation.delimiter, which is dark grey in my color scheme. Then we mark codeText inside the codeBlock as a @module, which is yellow in my color scheme. The latter query is more specific than the former, and so takes precedence. The result is a significant improvement over our starting point.

But it's still not great. Having yellow JavaScript code is a marginal improvement over just white code. This is where Tree-sitter's killer syntax highlighting feature comes into play: injections

Injecting a JavaScript parser

Injections are super cool. They allow us to succinctly state that certain nodes should be parsed with some other parser than the one we're currently using. Just like with highlights, injections are specified with queries. What makes this even cooler is that the author of some given parser doesn't need to have thought about this; you can tack it on afterwards due to queries being completely separate from the parser itself.

First of all, we need to extend our package.json with an injections key.

  "tree-sitter": [
    {
      "scope": "source.markdownsimple",
      "file-types": [
        "mds"
      ],
      "highlights": [
        "queries/highlights.scm"
      ],
      "injections": [
        "queries/injections.scm"
      ]
    }
  ],

Then we add a file at queries/injections.scm with the following query.

(((codeBlock (codeText) @injection.content))
  (#set! injection.language "javascript"))

Unfortunately, the highlight queries will interfere a bit with this injection, so we also need to reduce our queries/highlights.scm file to the following.

(heading) @string
(inlineCode) @property

But with that change, highlighting is now a lot more interesting. Although, admittedly, the color scheme could definitely use some work.

With that, we've achieved the highlighting goals we set out to at the beginning of the article.

Summary

In this article, we explored the basics of Tree-sitter and how to apply it to provide syntax highlighting for a simple markup language. The full source code for the project can be found in the companion repository. There are still numerous shortcomings with this implementation, such as the fact that codeBlocks cannot contain backticks, or that empty lines between paragraphs don't actually produce multiple paragraphs in the syntax tree. But the point of this article wasn't to produce a perfect Markdown Simple parser, but rather illustrate some fundamental concepts of creating parsers with Tree-sitter. And that, I think, has been achieved.

In the next part of this article series, we'll explore working with Tree-sitter in NeoVim!

The final versions of the grammar and queries are inlined below for your convenience.

// grammar.js
module.exports = grammar({
  name: 'markdownsimple',

  rules: {
    source_file: $ => repeat($.section),

    section: $ => seq(
      $.heading,
      repeat(
        choice(
          $.paragraph,
          $.codeBlock,
        )
      ),
    ),

    paragraph: $ => prec.right(repeat1(
      choice(
        $.inlineCode,
        $.text,
      ),
    )),

    inlineCode: $ => seq(
      '`',
      $.codeText,
      '`',
    ),

    codeBlock: $ => seq(
      '```',
      $.codeText,
      '```',
    ),

    codeText: _ => /[^`]*/,

    heading: _ => /#.+/,

    text: _ => /([^#`]|[^\n]#)+/,
  }
});

; queries/highlights.scm
(heading) @string
(inlineCode) @property

; queries/injections.scm
(((codeBlock (codeText) @injection.content))
    (#set! injection.language "javascript"))

Extending NeoVim for commenting and uncommenting code blocks

2024-02-29T00:00:00+01:00

I've been using some variation of Vim for going on a decade now, yet I've never bothered with an efficient way of commenting out code. It just isn't something that I do very often. But I saw a colleague comment out and uncomment lines of code like a breeze in VS Code, so obviously I had to also have that capability. And then choose to not use it. Let's see how you can get that option, too.

This article contains Lua code for configuring NeoVim. To follow along, paste the Lua code presented into any .lua file and run :luafile % with the file in the current buffer. This makes the commands defined within available in your current NeoVim session.

Commenting out code the hard way

A simple no-preparation way of commenting out code in any variation of Vim is to simply select a few lines of code in visual mode and then make a substitution. Let's say we're writing Python where the inline comment character is #, then commenting out a line would look like this.

:s/^/#/

The ^ character is the "start of line" character, so here we're simply replacing the start of line with #. And don't worry, the start of line isn't actually a character; there'll be a new start of line just in front of the # after the substitution. Select a range of lines in visual mode and run the same substitution and you'll comment out the entire range.

If you do this rarely it's fine. But if you do it regularly it becomes rather cumbersome. Luckily for us, NeoVim is easy to extend.

Defining a command to comment out code

To make the above substitution a bit less of an effort, we can define a command for it. First, we need to define a function that executes the code. A very simple first effort would look like this.

function comment_out()
    vim.api.nvim_command("s:^:#:")
    vim.api.nvim_command("noh")
end

There are a couple things to note about this function.

vim.api.nvim_command synchronously executes a command.
We use : instead of / as the substitution delimiter in anticipation of the C-style // inline comment.
We clear the search highlight with noh after doing the substitution.
- Figuring out what happens if you don't do that is left as an exercise to the reader.

So that's our function for performing the substitution, and if we simply place our cursor on a line we wish to comment out we can execute it like so.

:lua comment_out()

It should insert a # character at the start of the line. But this isn't terribly ergonomic, it'd be nicer if we could just call a native NeoVim command. Fortunately, defining such a command is pretty simple.

vim.api.nvim_create_user_command("CommentOut", comment_out, {})

This exposes the :CommentOut command, and we can now invoke our commenting out function like so.

:CommentOut

Still, this feels like some amount of effort, so let's add a couple of handy keybindings to do this. I like <leader>co which is a mnemonic for comment out.

vim.keymap.set("v", "<leader>co", ":CommentOut<CR>") -- visual mode keymap
vim.keymap.set("n", "<leader>co", ":CommentOut<CR>") -- normal mode keymap

Now we can simply invoke the commenting out routing with <leader>co, or whatever else you've decided to put in there. There are however two big problems left to address:

The :CommentOut command doesn't work with a range
- If you try to select a range of lines and run the function, you will encounter an error saying E481: No range allowed
The commenting character is hard-coded as #
- That approach works great if you only work with a single language, but if you like me work daily with multiple languages that have different line comment styles it's not ideal

Let's address these in turn.

Adding support for range selection

Being able to run our little :CommentOut command only on one line at a time is hardly useful, so let's make our command compatible with range selection. First we need to modify the creation of the command, which by default does not allow ranges.

vim.api.nvim_create_user_command("CommentOut", comment_out, { range = true })

With this modification, you'll be able to execute :CommentOut when a range is selected, but only the first line of the selection will actually be commented out. We need to add a few lines of code to the comment_out() function to actually act on the entire range. While we're at it, we'll also make the function local as it doesn't need to be accessible outside the module anymore.

local function comment_out(opts)
    local start = math.min(opts.line1, opts.line2)
    local finish = math.max(opts.line1, opts.line2)
    vim.api.nvim_command(start .. "," .. finish .. "s:^:#:")
    vim.api.nvim_command("noh")
end

Options for NeoVim commands are passed in via the opts table, and the line numbers of the selection are stored in line1 and line2. Other available options are detailed in the NeoVim API docs.

Note that depending on where you start the selection, either line may be the one at the top and bottom, respectively, so we do some rudimentary math to get the start and finish of our substitution in the right order.

Selecting a few lines of code and running :CommentOut (or using the keybind for it) now actually comments out those lines! Let's now figure out how to select the correct kind of line comment for the given file type.

Choosing line comment style by filetype

While there's probably a super clever way of accomplishing this, I went with something really rudimentary: a table with line comment styles by filetype. It looks something like this:

local non_c_line_comments_by_filetype = {
    lua = "--",
    python = "#",
    sql = "--",
}
local default_line_comment = "//"

local function comment_out(opts)
    local line_comment = non_c_line_comments_by_filetype[vim.bo.filetype] or default_line_comment
    local start = math.min(opts.line1, opts.line2)
    local finish = math.max(opts.line1, opts.line2)

    vim.api.nvim_command(start .. "," .. finish .. "s:^:" .. line_comment .. ":")
    vim.api.nvim_command("noh")
end

Basically, we default to the // line comment style unless we find a match in the non_c_line_comments_by_filetype table. We use vim.bo.filetype to get the current buffer's filetype. Then we do some more string concatenation in the substitution command, and that's pretty much that. If you need other line comment styles, just add them to the table.

If you now run the :CommentOut command in a Python file, it'll use # as the comment style. But if you do it in a SQL file, it'll use --, and in a Go file it'll use //.

Uncommenting code

We now have a neat way of commenting out code, but what about uncommenting code? Let's straight up copy comment_out() and just replace the substitution expression with something that removes a line comment start.

local function uncomment(opts)
    local line_comment = non_c_line_comments_by_filetype[vim.bo.filetype] or "//"
    local start = math.min(opts.line1, opts.line2)
    local finish = math.max(opts.line1, opts.line2)

    vim.api.nvim_command(start .. "," .. finish .. "s:^" .. line_comment .. "::")
    vim.api.nvim_command("noh")
end

vim.api.nvim_create_user_command("Uncomment", uncomment, { range = true })
vim.keymap.set("v", "<leader>uc", ":Uncomment<CR>")
vim.keymap.set("n", "<leader>uc", ":Uncomment<CR>")

Note: The keybinding is a mnemonic for uncomment.

This mostly works. If you first run :CommentOut and then :Uncomment, the latter cancels out the former. But there are two notable shortcomings.

If you run :Uncomment on a line that deos not start with a line comment start you encounter an error saying Pattern not found:
The line must start with a line comment start; leading whitespace causes the substitution to fail
If you have a formatter that indents line comments, you'll be in trouble!

The first issue is simple to resolve: we wrap the call to vim.api.nvim_comand in a protected call using the pcall() function. This allows us to handle errors, but in fact all we want is to prevent any error from bubbling up to the surface; we don't really care if the substitution fails or not as a failure simply indicates there was no comment to uncomment.

The second issue requires a little bit more thought. We want to be able to uncomment lines even if there's leading whitespace to be give the command some more flexibility. With # as the line comment, a first attempt could look like this.

:s:^\s\{-\}#::

^ is the line start meta character, \s denotes any whitespace except for linebreaks, and \{-\} means "zero or more`. This mostly works, but it removes both the leading whitespace and the comment character, effectively dedenting the line. We need to capture the whitespace and put it back after removing the line comment start.

:s:^\(\s\{-\}\)#:\1:

The escapes makes the pattern a bit difficult to read, but all we've done here is to wrap the "zero or more whitespace" in a capture group (denoted by parentheses), and then we reference that capture group in the replacement part of the expression with \1.

Putting all of this together, we end up with the following uncomment() function.

local function uncomment(opts)
    local line_comment = non_c_line_comments_by_filetype[vim.bo.filetype] or "//"
    local start = math.min(opts.line1, opts.line2)
    local finish = math.max(opts.line1, opts.line2)

    pcall(vim.api.nvim_command, start .. "," .. finish .. "s:^\\(\\s\\{-\\}\\)" .. line_comment .. ":\\1:")
    vim.api.nvim_command("noh")
end

Now it should work pretty well!

Summary and full code

That's pretty much it for commenting and uncommenting code, at least as far as my semi-imagined needs for it go. This represents among the first non-trivial extensions I've made to NeoVim using only its API and made me realise just how much power I've left untapped for so many years. I will undoubtedly return with more blog posts on extending NeoVim in the future; it's just too much fun not to.

The full code can be found below. You can just copy and paste it into your root init.lua file and it should work without any further tweaks. For a more segregated placement, you can draw inspiration from my NeoVim configuration.

local non_c_line_comments_by_filetype = {
    lua = "--",
    python = "#",
    sql = "--",
}

local function comment_out(opts)
    local line_comment = non_c_line_comments_by_filetype[vim.bo.filetype] or "//"
    local start = math.min(opts.line1, opts.line2)
    local finish = math.max(opts.line1, opts.line2)

    vim.api.nvim_command(start .. "," .. finish .. "s:^:" .. line_comment .. ":")
    vim.api.nvim_command("noh")
end

local function uncomment(opts)
    local line_comment = non_c_line_comments_by_filetype[vim.bo.filetype] or "//"
    local start = math.min(opts.line1, opts.line2)
    local finish = math.max(opts.line1, opts.line2)

    pcall(vim.api.nvim_command, start .. "," .. finish .. "s:^\\(\\s\\{-\\}\\)" .. line_comment .. ":\\1:")
    vim.api.nvim_command("noh")
end

vim.api.nvim_create_user_command("CommentOut", comment_out, { range = true })
vim.keymap.set("v", "<leader>co", ":CommentOut<CR>")
vim.keymap.set("n", "<leader>co", ":CommentOut<CR>")

vim.api.nvim_create_user_command("Uncomment", uncomment, { range = true })
vim.keymap.set("v", "<leader>uc", ":Uncomment<CR>")
vim.keymap.set("n", "<leader>uc", ":Uncomment<CR>")

Happy editing!

Adding guardrails to psql for PostgreSQL

2024-02-24T00:00:00+01:00

I've been using psql for many years to interface with PostgreSQL databases. It's simple, pretty much always available as it's usually bundled with PostgreSQL and just does what it's supposed to. It does, however, have some pretty dangerous defaults. Not only are writes allowed, but it also automatically commits any statements executed outside of explicit transactions. Let's fix that.

Configuring `psql` with `.psqlrc`

The .psqlrc file can be used to set defaults for new connections made with psql. It should be placed in your home directory (i.e. ~/.psqlrc) and most often contains \set and SET commands.

The difference between \set and SET commands is a bit subtle. A SET command sets a session variable inside of the PostgreSQL server, whereas a \set command sets configuration in the psql client itself.

Making the default transaction read-only

PostgreSQL executes all statements as transactions. If you don't start an explicit transaction for a statement, PostgreSQL automatically wraps the statement in a transaction. By default transactions are both readable and writable, which isn't necessarily desirable. To make transactions read-only by default, you can set the following session variable inside of PostgreSQL.

SET default_transaction_read_only="on";

You can do this in any PostgreSQL session. It can also be configured on the database server, but here we assume it isn't. If you after having done that try to execute a statement that writes to the database, you'll get an error.

test_db=# CREATE TABLE test_table (id serial);
ERROR:  cannot execute CREATE TABLE in a read-only transaction

However, it's still possible to create an explicit transaction that's allowed to write, so it doesn't hinder you if you do need to write.

test_db=# BEGIN READ WRITE;
BEGIN
test_db=*# CREATE TABLE test_table (id serial);
CREATE TABLE
test_db=*# COMMIT;
COMMIT

To set this in .psqlrc, simply add the exact same line you'd use in an interactive session to ~/.psqlrc. When you connect to a database, any SET command in .psqlrc is automatically executed.

Pitfall: SET commands in .psqlrc are only executed when you first establish a connection to the database server. If you switch database after having connected with \c, the SET command is not executed again. Keep that in mind.

Disabling autocommit

psql automatically commits the implicit transaction created when you execute a statement outside of an explicit transaction. This can be altered such that psql implicitly creates a transaction without automatically committing it by setting the AUTOCIMMIT variable to off. This is a psql variable, so we use \set to set it.

\set AUTOCOMMIT false

To validate that it's working as expected, you should look for the * in the psql prompt.

test_db=# CREATE TABLE test_table (id serial);
CREATE TABLE
test_db=*# ROLLBACK; -- <=== * in the prompt signifies transaction
ROLLBACK

Combined with default_transaction_read_only, this automatically creates a read-only transaction that's not immediately committed, putting two layers of protection between you and an errenous write. Unlike setting default_transaction_read_only however, disabling AUTOCOMMIT persists even if you switch database with \c.

Summary

When connecting to any database that you for some reason don't want to accidentally write to, you should have guardrails. Many graphical PostgreSQL clients that I've seen my colleagues use have such guardrails by default in that an explicit commit must be issued by clicking a button. psql does not, so setting up some yourself is imperative.

With AUTOCOMMIT disabled and default_transaction_read_only set to on, you should however be about as safe as can be.

First impressions of Wayland on Arch Linux

2024-02-04T00:00:00+01:00

Wayland appears to be the future of window systems in the world of Linux, replacing the aging X.Org window system. The Arch Wiki article on X.Org refers to it as "the alternative and successor [of X.Org]" and Canonical even dropped its own Mir display server in favor of Wayland on Ubuntu. So when it became time for me to configure a new laptop in the beginning of January, I decided to do give Wayland a go. Here's what I think of it so far.

What the heck is a window system?

First of all, what the heck is a window system? In short, it's the component that renders graphical windows to your screen and communicates the user's input to the underlying operating system. The X Window System, or just X for short, is the reigning king of window systems in UNIX-like operating systems (excepting macOS which uses Quartz). You will often see it referred to as X.Org; this is simply the most commonly used open source implementation of X.

I could try to outline what X.Org's problems are and how Wayland solves many of them, but I simply don't have the level of understanding necessary to do so confidently. One thing to note is that Wayland is first and foremost a protocol for the communication between a display server and applications. A display server can implement the Wayland protocol and is then referred to as a Wayland compositor.

If you want to learn more, here's a brief article on the subjecp.

Wayland first impressions with Sway

I recently setup Wayland on a brand new work computer, and going from a clean slate setup was really effortless. I chose Sway as my compositor and started it. And it just kind of worked. Outside of configuring Sway to my liking, I didn't really have to do much of anything for it to work.

On my private Dell XPS 15 where I'm already running X.Org, setting up Wayland was similarly effortless. I just installed Sway, and even though it doesn't support NVIDIA's proprietary drivers, running Sway with the --unsuported-gpu option just worked.

All-in-all, setup was a breeze and I did not encounter any problems at all.

Upsides of Sway

I immediately noticed some improvements in my desktop experience. The first came when connecting an external display, which Sway just found and started outputting an image to. In almost a decade of using X.Org on just shy of a half dozen devices, that has never happened before without effort on my part.

I was also treated with a complete lack of screen tearing when scrolling websites and documents, which I had a lot of under X.Org. I'm sure there's a way to configure that away with X and I never bothered to look into it more than stating that it wasn't as easy as toggling a flag, but it was just for free with Wayland. Now that it's gone I would have a hard time going back to it.

Finally, most things just kind of work. It's a big accolade to note that over a whole month of use, I haven't encountered a single show-stopping problem. But I have definitely encountered quite a few lesser ones.

Downsites of Sway

Pretty much all of the downsides I've encountered have to do with the fact that a lot of applications I've used over the years were written for X.Org. This means that many applications run under the the XWayland compatibility layer, which nullifies the performance improvements of Wayland's architecture. I've especially noted that startup times for GUI apps that run under XWayland are slow compared to running X.Org directly.

Another problem is that apps directly related to X.Org typically just do not function. For example, I've been using the import command from imagemagick to grab snippeted screenshots for nigh on a decade, and it doesn't work at all with Wayland as it expects to work with an X.Org backend. Similarly, Redshift which I've been using for years to adjust color temperature does not support Wayland. It results in a whole lot of alternative for Wayland kinds of searches, but they usually bear fruit. I found grim as a replacement for import and gammastep as a replacement for redshift that way.

The incompatibility with Redshift does bring up another interesting point, namely that Wayland is still neither complete nor stable. Color management is for example still not standardized. In some respects, stepping into the land of Way feels a bit like using a beta product.

Conclusions

Will I keep using Wayland? Definitely. I've only encountered a few very minor problems with application compatibility, and most I've found alternatives for. I'm even a little bit excited to find some need completely unfulfilled in the land of Wayland applications, because that would give me a good reason to implement it myself.

Dependabot's dependency grouping is awesome

2024-01-29T00:00:00+01:00

I've been using GitHub's Dependabot since it was released around 4 years ago, and to a large extent, it's been great. Except for one thing: the sheer amount of pull requests Dependabot would open for dependency updates. For some of my repositories it became more of a chore to keep up with Dependabot's pull request spamming than to just manually update dependencies every once in a while.

Well. Turns out that's been fixed.

Dependabot's big problem: Pull request spam

Since Dependabot was released, it's only mode of operation has been to open one pull request per dependency update. As you can imagine, that leads to a whole bunch of pull requests being opened. I would routinely get dozens of dependency pull requests a week for some of my more dependency heavy repositories. Needless to say, that's just not manageable.

One of my projects, Spork, which is mostly in maintenance mod is one such example where I just gave up on keeping up with the updates. It ended up looking like this:

What a mess. But there's light at the end of the tunnel.

The fix: Grouped dependencies

To address that problem GitHub recently released Grouped Version Updates as a feature for Dependabot. In short, this means that you can get Dependabot to open pull requests containing multiple dependency updates, grouped in three different ways.

A configuration example

For my project Spork, I decided to group dependencies in three groups:

Updates to GitHub Actions dependencies
Updates to production dependencies
Updates to development dependencies

Configuring this was really straightforward, all it took was this commit.

     directory: "/"
     schedule:
       interval: "daily"
+    groups:
+      actions-deps:
+        patterns:
+          - "*"

   - package-ecosystem: "maven"
     directory: "/"
     schedule:
       interval: "daily"
+    groups:
+      dev-deps:
+        dependency-type: "development"
+      prod-deps:
+        dependency-type: "production"

In full, the configuration looks like so.

version: 2
updates:

  - package-ecosystem: "github-actions"
    directory: "/"
    schedule:
      interval: "daily"
    groups:
      actions-deps:
        patterns:
          - "*"

  - package-ecosystem: "maven"
    directory: "/"
    schedule:
      interval: "daily"
    groups:
      dev-deps:
        dependency-type: "development"
      prod-deps:
        dependency-type: "production"

For each package ecosystem, you must group on something. For github-actions, I just wanted a single group, and the way to achieve such an "everything group" is to simply use a wildcard pattern. For maven, I ended up splitting by the dependency-type attribute instead, which can be either development or production.

But that's just two ways to group, you said there were three ways? Very attentive of you! In addition to the pattern and dependency-type groupings, you can also group by update-type, which can be patch, minor or major and only really works for dependencies that comply with Semantic Versioning

Effects of the configuration

Within minutes, Dependabot had closed all* of the single dependency pull requests and created two new pull requests. One PR contained the GitHub Actions updates, which I could immediately merge.

*For reasons I don't quite get at this point, one single-dependency PR for gumtree-spoon-ast-diff remained open

Note that the actions-deps name, which is one of the keys in the YAML config file, ends up being written out in the pull request title. Another PR was opened for the grouped production dependencies, but it did not pass CI.

This showcases the one downside with grouped dependency updates: there's no indication as to which dependency caused the CI failure. If you scroll up to the first image of this article, you'll similarly see the one benefit of single dependency pull requests: the update for jgit is failing.

Note: As stated above, gumtree-spoon-ast-diff wasn't included in the grouped dependency update for some reason, so we'll ignore that it's also failing.

So I decided to try to get Dependabot to open a new pull request without jgit.

Ignoring certain dependencies in a group

I knew that jgit was the issue in this case and decided to ignore it. It seemed like ignoring minor version updates was the best path forward, so I issued an @dependabot ignore <dependency_name> minor version command.

This didn't quite have the effect I was looking for, as Dependabot then proceeded to open a new pull request with the previous minor version of jgit, which is also incompatible with Spork. At this point I decided to bring out the big guns and ignore jgit altoghether for now, as I knew I needed to put in some manual work to update it anyway.

This finally resulted in what I wanted, a new PR without a bump to jgit that does not break the build. Thus, it's ready to merge.

Of course, this comes with the caveat that I need to remember to unignore jgit when I get around to updating it. But I have 8 dependency updates with not too much effort, which is kind of cool!

Closing thoughts

I had somewhat given up on keeping dependencies up to date in my lesser loved projects. With grouped dependency updates, I feel like it will be possible for me to get back to keeping dependencies in good shape even for projects that I no longer actively work on.

The one downside is that it's no longer evident which dependency update breaks the build. However, over many years of maintaining projects built in various languages and technologies, experience tells me that it's way more common for dependencies to update just fine than for them to break something, so I think it's a tradeoff well worth taking. As patterns of which dependencies break the build more often start to become apparent, one can also refine the groupings and separate dependencies that break often from those that rarely or never do so.

To summarize, grouped dependency updates is a killer feature that makes Dependabot viable again. Everyone should use it, and it's honestly a shame that backwards compatibility demands makes it such that groups will probably never be the default.

A new dark theme for the blog!

2023-10-25T00:00:00+02:00

If you've ever visited my blog before, you'll notice it looks a bit different. I finally took the time to make a dark theme! Only 5 or so years late to the party. But late or not, I have a new theme to show off, so let's have a look!

Before and after

I only put a couple of hours into trying out different colors and contrasts, so this is still a bit of a work in progress. But I can't say I'm not pleased. Let's have a look at what it used to look like.

So, that doesn't look terrible in any way, but it doesn't fit the "programmer aesthetic". It doesn't fit my aesthetic. The new theme is much more "me", and although you should already be seeing it all around here, here's the same image as above with the new theme.

Much better! There are still a few rough edges, and I keep finding details where the old theme isn't being overridden correctly (because this is all just on top of Bootstrap), but it's all mostly there.

Parts of the puzzle

There are really only two parts to this. I'm still using the same voidy-bootstrap theme as before, but I rewrote some of the CSS with the somewhat Halloween-looking experience you're looking at right now. Which is fitting, considering Halloween is just around the corner!

The second part is the fantastic dracula theme for Pygments which gives the neat colorization of code snippets. That I find to be the single largest improvement.

Enjoy!

That's all there is to it, enjoy the new theme!

What does the number in a man page mean?

2023-10-24T00:00:00+02:00

If you open a man page on a *NIX system (such as a Linux distro), you'll always see a number next to the subject of the man page. Like GIT(1), SUDO(8) or open(n). What's that thing in parentheses? To cut a long story short, it's the section the man page belongs to. Let's discover what that means.

`man` pages are divided into sections

man pages act as a system reference manual on any *NIX system. All man pages have a heading containing the name of the page, its section and a very short description. Something like this.

<name>(<#section>)     <description>     <name>(<#section>)

As a concrete example, here's the first line you get if you execute man git.

GIT(1)                         Git Manual                        GIT(1)

The name is self-explanatory, is is the short description, but the section number is not so transparent in its meaning. To find out what it means, we can actually consult the man page for the man program itself.

$ man man
MAN(1)                     Manual pager utils                    MAN(1)
[...]
The table below shows the section numbers of the manual followed
by the types of pages they contain.

1   Executable programs or shell commands
2   System calls (functions provided by the kernel)
3   Library calls (functions within program libraries)
4   Special files (usually found in /dev)
5   File formats and conventions, e.g. /etc/passwd
6   Games
7   Miscellaneous (including macro  packages  and  conventions),
    e.g. man(7), groff(7), man-pages(7)
8   System administration commands (usually only for root)
9   Kernel routines [Non standard]
[...]

This sectioning makes it possible to have multiple man pages with the same name, but in different sections. By default, you'll get only one result when executing man <name>, and which one you get is dependent on a pre-defined search order. Which, of course, we can also find in the man page for man.

The  order  of sections to search may be overridden by the envi‐
ronment  variable  $MANSECT  or  by  the  SECTION  directive  in
/etc/man_db.conf.  By default it is as follows:

      1 1p n l 8 3 3p 0 0p 2 3type 5 4 9 6 7

In my personal experience, the default search order most often gives you what you want. But sometimes it doesn't, and then you need to figure out how to find the man page you're after.

Selecting man pages from different sections

To select a page from a particular section, you specify the section before the name. For example, as indicated by the excerpt from MAN(1) above, there's also a man(7). We can get it like so.

$ man 7 man
man(7)              Miscellaneous Information Manual             man(7)
[...]

But what if you don't actually know which section the man page you're looking for is in, you only know that the one you're looking at isn't the one? Then you can use whatis. For example, I have four different man pages named open on my machine.

$ whatis open
open (2)             - open and possibly create a file
open (3p)            - open file
open (3perl)         - perl pragma to set default PerlIO layers for input and output
open (n)             - Open a file-based or command pipeline channel

Here you can also see a couple of section numbers that look a bit different, namely 3perl and n. These don't belong to any of the standard sections, but you can open them just the same. For example, man 3perl open would open the open page from the custom 3perl section.

Note: If you don't see all available man pages when running whatis for a particular page, your man database is out-of-date, you may want to run mandb manually (or whatever is equivalent on your system) to rebuild the search index.

And that's all!

Now you should hopefully be a bit more confident in finding the man page you're looking for. And if at any point you forget what I've written here, almost all of the information is available just a man man away!

The sheer insanity of interfaces and nil in Go

2023-10-21T00:00:00+02:00

If you've only dabbled briefly in Go, you might think that its nil is analogous to the good ol' "billion dollar mistake" known as null. I thought so, too, up until just a few weeks ago when I decided to make a pass through Thorsten Ball's neat little book Writing an Interpreter in Go. That's when I first cut myself real bad on nil, or more specificaly, an interface with a nil value.

The sensible kind of `nil`

First let's look at nil behaving like we'd expect. Here's a code snippet of a "parser" that somewhat shows the situation I had.

package main

import (
    "fmt"
    "strings"
)

type IfExpression struct {
    raw string
}

func parseIf(s string) *IfExpression {
    if strings.HasPrefix(s, "if") {
        return &IfExpression{raw: s}
    }

    return nil
}

So, we have a parse function that returns a pointer to an IfExpression struct if the input is an if expression (with an incredibly loose definition of what constitutes an if expression). Let's add a main function and try it out.

func main() {
    ifExpr := parseIf("if a == b")
    fmt.Println("ifExpr", ifExpr, ifExpr == nil)

    notIfExpr := parseIf("not an expression")
    fmt.Println("notIfExpr", notIfExpr, notIfExpr == nil)
}

Running this results in completely sensible output; ifExpr is not nil while notIfExpr is.

$ go run main.go
&{if a == b} false
<nil> true

Say what you want about nil, this at least makes complete sense and is intuitively understandable. But when we throw interfaces into the mix, that goes way out the window.

The completely insane kind of `nil`

As my parser came along, it turned out I needed multiple kinds of expressions, and so had to abstract the parse function to be able to return multiple kinds of expressions. So let's generalize the expression in an interface and add another one.

type Expression interface {
    Raw() string
}

type IfExpression struct {
    raw string
}

func (ie *IfExpression) Raw() string { return ie.raw }

type ForExpression struct {
    raw string
}

func (fe *ForExpression) Raw() string { return fe.raw }

And then we adapt the parsing as well.

func parse(s string) Expression {
    if ifExpr := parseIf(s); ifExpr != nil {
        return ifExpr
    } else {
        return parseFor(s)
    }
}

func parseFor(s string) *ForExpression {
    if strings.HasPrefix(s, "for") {
        return &ForExpression{raw: s}
    }

    return nil
}

func parseIf(s string) *IfExpression {
    if strings.HasPrefix(s, "if") {
        return &IfExpression{raw: s}
    }

    return nil
}

Now, we have a more generalized parse function that tries to parse an if expression, and falls back on parsing a for expression if it turns out not to be an if. We then adapt our main function to make use of this.

func main() {
    ifExpr := parse("if a == b")
    fmt.Println("ifExpr", ifExpr, ifExpr == nil)

    forExpr := parse("for a in b")
    fmt.Println("forExpr", forExpr, forExpr == nil)
}

And at first glance, this seems to work as expected.

$ go run main.go 
ifExpr &{if a == b} false
forExpr &{for a in b} false

But what happens when you try to parse something that is neither an if nor a for? We add the following to main to find out.

    notAnExpr := parse("not an expression")
    fmt.Println("notAnExpr", notAnExpr, notAnExpr == nil)

Clearly, as "not an expression" is neither an if expression nor a for expression, both parseIf and parseFor will return nil, and as such parse should return nil. But that's not really what happens.

$ go run main.go 
ifExpr &{if a == b} false
forExpr &{for a in b} false
notAnExpr <nil> false

What? notAnExpr is <nil>, but it does not compare true to a literalnil? What does that mean?

The confusing type-and-value composition of interfaces

In Go, an interface is represented at runtime as a type and a value. You can think of it as a struct with two fields.

type Interface struct {
    type  string
    value interface{}
}

So, when parseIf(s) returns nil, the parse(s) function's return statement wraps the nil value into something like this.

Interface {
    Type: "IfExpression",
    Value: nil,
}

Armed with this knowledge, we can actually check for nil using runtime reflection.

func IsNil(value interface{}) bool {
    return reflect.ValueOf(value).IsNil()
}

func main() {
    ifExpr := parse("if a == b")
    fmt.Println("ifExpr", ifExpr, IsNil(ifExpr))

    forExpr := parse("for a in b")
    fmt.Println("forExpr", forExpr, IsNil(forExpr))

    notAnExpr := parse("a == b")
    fmt.Println("notAnExpr", notAnExpr, IsNil(notAnExpr))
}

And running this, we now seem to have a functioning way to check for nil interfaces.

ifExpr &{if a == b} false
forExpr &{for a in b} false
notAnExpr <nil> true

But do we really? Of course not.

Actually, interfaces can also be "completely" `nil`

Let's make this even more confusing, and refactor our parse function a bit. Instead of "falling back" on parseFor, we simply return nil explicitly if we can't parse any known expression.

func parse(s string) Expression {
    if ifExpr := parseIf(s); ifExpr != nil {
        return ifExpr
    } else if forExpr := parseFor(s); forExpr != nil {
        return forExpr 
    }

    return nil
}

Now let's run it again.

$ go run main.go 
ifExpr &{if a == b} false
forExpr &{for a in b} false
panic: reflect: call of reflect.Value.IsNil on zero Value

Suddenly, notAnExpr is nil, causing our check to panic. What in the world is going on here? As it turns out, explicitly returning nil from a function that returns an interface is semantically different to returning a variable contaning nil value, and typed with something implementing the interface.

return nil // actually returns nil

var value *ForExpression = nil
return nil // returns an interface with type *ForExpression and value nil

So a way around this is of course to always return nil in the form of a typed variable. But I'm not satisfied with that, because even if I'm very strict about it myself, others may not be. And I may also just make an honest mistake. So let's tweak our IsNil() function to account for true nil.

func IsNil(value interface{}) bool {
    return value == nil || reflect.ValueOf(value).IsNil()
}

If you run this, you'll see that the runtime panic is replaced with the following.

notAnExpr <nil> true

So we're done? Unfortunately, not quite.

Structs are a problem

Our IsNil() handles true nil and interfaces with nil values. But, unfortunately, it does not handle structs.

value := IfExpression{raw: "if"}
fmt.Println(IsNil(value)) // panic: reflect: call of reflect.Value.IsNil on struct Value

Poop. It appears we missed a critical part of IsNil()'s documentation. And by missed, I of course mean that we didn't read it, but here's the relevant part.

IsNil reports whether its argument v is nil. The argument must be a chan, func, interface, map, pointer, or slice value; if it is not, IsNil panics.

So, we need to check if the value is of any of these nil-able types, and otherwise assume that it is not nil.

func IsNil(value interface{}) bool {
    if value == nil {
        return true
    }

    reflected := reflect.ValueOf(value)
    switch reflected.Kind() {
        case
        reflect.Chan,
        reflect.Func,
        reflect.Interface,
        reflect.Map,
        reflect.Ptr,
        reflect.Slice:
        return reflected.IsNil()
    }
    return false
}

And with this, the panic is abated.

value := IfExpression{raw: "if"}
fmt.Println(IsNil(value)) // false

This will work so long as Go doesn't add some other nil-able type. Given how conservative the language is, such an addition to the core types seems an unlikely prospect. But of course, it could happen, and then this function would return false incorrectly in some cases.

Footguns

All languages have footguns. Some languages, like C++ and JavaScript, are seemingly built exclusively of them. Go is a small-ish and conservative language, and so naturally has fewer such contraptions. But how nil is handled differently in different contexts and how the type of a variable can change the actual value that's returned from a function, is a big one.

Of course, all of this may just be symptoms of my not knowing Go very well, and this whole article could just be the frustrated ramblings of an apprentice. But, I'm an experienced programmer, and this confused me quite a bit. That alone should be sufficient evidence that some design choices made here aren't optimal.

Book Review: Writing an Interpreter in Go

2023-10-14T00:00:00+02:00

I love programming languages, both using them and implementing them. As such, I found the concept of learning Go by creating a programming language to be just delightful. And, to put it briefly, it was. Let's talk about Thorsten Ball's book on interpreters. In Go.

Writing an Interpreter in Go
by Thorsten Ball
Publisher: Thorsten Ball
ISBN: 9783982016115

You can buy the book directly from Thorsten Ball's website at https://interpreterbook.com/.

The book in a nutshell

Writing an Interpreter in Go is precisely what it sounds like; a practical guide to writing a fully functioning interpreter in Go. You go from source code, to tokens, to abstract syntax tree and tree evaluation. The parser is based on the pretty fascinating Pratt parsing technique, while the evaluation is based on simple tree-walking. This relative simplicity allows Ball to cram a whole lot of functionality into a very concise 200 pages worth of book. The result is impressive, and I am somewhat astonished by just how much content is actually in here, and how well written said content is.

What I liked

There is so much to like about this book. First of all, it is wonderfully standalone. I wrote around 3500 lines of code based entirely on descriptions and code samples from the book's 200 printed pages. Although the source code for the interpreter is available as part of the book, I never needed to reference it.

Something else I just barely needed to reference was documentation for Go itself. While this book doesn't teach you programming and is therefore not for complete beginners, it does somewhat teach you Go by example. There are little to no explanations for how Go works, but due to how simple Go is I think the examples speak for themselves. A caveat to that is that I have done some Go programming in the past, and I think that a complete beginner to Go should probably go through the interactive A Tour of Go tutorial first. The only thing that I personally needed to reference was some details around how interfaces and nil work together in Go (it's really quite unintuitive). It should also be noted that there is nothing about concurrency in this book, which being a major selling point of Go entails that this book omits some important parts of the language. I do however find that completely reasonable given the scope of the book.

The book is also for the most part organized in a way that fit me very well. In the first few parts of the book, the implementation is given up front and then tests are added to verify the behavior. The majority of the later parts of the book are however laid out in the opposite order, with descriptions of functionality and tests being followed by the actual implementation. This allowed me to read the tests and the descriptions to get a good idea of the intended behavior, and then try my own hand at coming up with a solution. Comparing my solution to the author's after the fact was a great way to cement the knowledge.

The source code exhibits a splendid balance between proper design and being simple enough to put into a book this short. There were several design decisions that I did not fully agree with, especially with the parsing of strings, but I can also see that certain simplifications had to be made to fit the format. For the most part, I think these simplifications are well chosen.

Concepts are explained clearly and intuitively. Pratt parsing is by far the most involved topic in the book, and I think Ball presents it in a very digestible fashion. I needed to run through it a couple of times and mentally step through the code, but when it clicked I found no fault in how the concepts were explained. At no point did I need to reference external sources to understand something.

The last part of the book adds in a few extra features, such as arrays and hash maps. All of these extra features are implemented in a very satisfying loop, going all the way from the lexing to the finished evaluation in one short chapter for each feature. Adding one feature at a time in this way really aids in understanding the workings of each part of the interpreter. And that, in fact, is a perfect segue to the one part of this book that I did not entirely like.

What I didn't like

I actually started this book around five years ago, but never finished it. In fact, I got just barely halfway due to what I perceive as the one flaw of this otherwise fantastic piece of literature: it has a rather slow start. It's not until halfway through that you actually get to evaluation and thereby create a complete path from source code to output. The fantastic pacing of the last quarter of the book where you add features one at a time highlights that the first half isn't as satisfying as it probably could be. While I recognize that the amount of groundwork to put down before getting to the exciting parts is a tough one to make, I think the book would have been better off closing the path from source code to evaluation earlier. For instance, I don't see any need to parse function definitions before evaluating arithmetic expressions.

Conclusions

This is a great book. It's clear, concise and packed full of learnings. Due to the way the last quarter of the book is laid out, adding feature upon feature from start to finish, I was able to quite effortlessly add features completely of my own design by the time I finished the book. The only complaint I have about the book is that not more of it is laid out in that fashion, because it's just so good.

If you've dipped your toes in Go and want an exciting project to learn the language better, I think this is a great way to do it. It takes a little while before the book gets going for real, but when it does, it really takes off.

Fix 3D graphics in Arch Linux on Dell XPS 15 9520

2022-12-26T00:00:00+01:00

I recently got myself a Dell XPS 15 9520 to replace my aging laptop. It's highly Linux-compatible and just following the official installation guide got me 90% of the way of having a well-behaved Arch Linux laptop. One big thing wasn't working, though: 3D graphics!

Symptoms: glitchy 2D graphics and unusable 3D graphics

2D graphics worked mostly fine after installing xf86-video-intel, nvidia and nvidia-prime. Websites, most desktop applications as well as 2D games like Terraria worked without issue. Anything with the slightest hint of 3D would however glitch out terribly with Intel graphics and just be completely black with 3D graphics.

Some applications, such as Steam, wouldn't render properly unless in fullscreen. 3D benchmark software like fire (from mesa-demo) and glmark2 wouldn't update properly with Intel and would again be black with NVIDIA.

Solution: Using DRI2 for the Intel driver

The solution I found to this problem was to use DRI2 for the Intel driver. I have no idea of why this works, but it does. You can configure the Intel driver to use DRI2 by creating a configuration file for X.

# /etc/X11/xorg.conf.d/20-intel.conf
Section "Device"
  Identifier "Intel Graphics"
  Driver "intel"
  Option "DRI" "2"
EndSection

Note: The modesetting driver also works, but it interferes with PRIME offloading s.t. the NVIDIA GPU is always active. This butchers battery life and so wasn't acceptable to me.

Need more tweaks?

This wasn't the only tweak I put in to get my XPS 15 in tip-top shape, although it was the most crucial one. I've got a bunch of other tweaks listed over on my Wiki page for the XPS 15.

Book Review: CPython Internals

2022-11-20T00:00:00+01:00

About a month ago I signed myself up to do a talk at a Python meetup hosted by HiQ. I brazenly set my topic as Under the Hood of CPython, thinking I had sufficient understanding of its inner workings to produce a riveting talk. As I started preparing the talk, I came to the gut-wrenching conclusion that my knowledge was too shallow, I simply didn't know enough of the details to put together an in-depth talk on the subject. Thankfully, I knew where to turn to for the details I needed: Anthony Shaw's CPython Internals book. Here's what I think of it.

CPython Internals
by Anthony Shaw
Released May 2021
Publisher(s): Real Python (realpython.com)
ISBN: 9781775093344

Confused about the difference between Python and CPython? See The difference between Python and CPython.

The book in a nutshell

CPython Internals gives you a guided tour of the CPython project, from parsing source code to compiling bytecode to interpreting said bytecode. The book is meant to serve as a starting point for budding CPython contributors or Python developers that simply want to learn a bit more about the reference implementation. It highlights the most important files in the project for each of the respective parts and guides you through their execution. Throughout the book we also get to follow along with a worked example of extending the language with an "almost equal" operator, written as ~=.

The book concludes with three concrete ways in which you can use the knowledge you've attained: 1) creating C extensions, 2) improving existing Python programs by leveraging knowledge of the internals and 3) contributing to the CPython project. While point 2) is potentially a little bit vague, points 1) and 3) are concrete and well described.

Before writing the book, Anthony wrote an in-depth article on the same topic. You can find it over on Real Python. It is something of an appetizer for the book, but stands strong on its own. Having a brief look at that article will give you a better understanding of what the book is about than anything I could write here.

What I liked

I went from surface-level understanding of the CPython project to being pretty confident about where to poke around to do what just from reading this book. It's comprehensive in scope and the worked example of the ~= operator helps a lot in facilitating an understanding for how to extend CPython with your own silly things.

I also appreciated the nods to other respectable sources. The Python Developer's Guide is a great resource for quickly refreshing how to do something (but going from 0 knowledge about the project it's a bit to terse). Luciano Ramalho's book Fluent Python 2nd ed is also noted as an excellent reference on the Python object model, which I absolutely agree with.

There is enough context in each chapter that you don't really need much pre-existing understanding of any of the subjects. If you can read Python code and have a little bit of experience with reading C code, you're all good to go. Concepts relevant to the book such as parallelism and memory management are explained both on an abstract level and in how they are implemented in CPython. The book is to a great extent a standalone resource and it should be very approachable even to developers without much experience. There's even an appendix at the end to explain what little you need to know about the C programming language to be able to understand the code samples.

What I didn't like

There was nothing about the book that I thought was bad, but for me personally, I would have preferred less explanation of fundamental concepts (such as threading), and more in-depth details on CPython itself. That being said, I think Anthony overall has made good calls on the tradeoffs between depth and approachability. Given that the book is meant to be a starting point for CPython development as opposed to a complete reference, I think that this nit-pick of mine is nothing more than personal preference, and perhaps that I'm slightly outside the target audience of the book. It's clearly meant to contain everything you'd need to know to go from zero to hero, and it's a lot easier to skip over content you feel is redundant than it is to find content you didn't know you needed.

Conclusions

CPython Internals saved my neck. I had three weeks to go from a shallow understanding of the CPython project to being able to explain it to others in a ~30-minute talk, and I made it. Without this book I wouldn't have. It was so approachable that I could use it for "Sunday reading" before bedtime and in spare minutes on public transport.

I highly recommend this book to anyone who's interested in the CPython project. It's not necessary to have future CPython contributions as a goal to get a lot out of this book, I found it incredibly interesting in its own right. The fact that Anthony has managed to pack so much information, with so much context (recall that I thought there was too much of that) in less than 400 pages is nothing short of spectacular.

The difference between Python and CPython

2022-11-19T00:00:00+01:00

At one point or another, every Python developer or hobbyist encounters the word CPython. For example, the dis module states that it exposes an "implementation detail of the CPython interpreter". What does that mean? Some ask themselves how CPython differs from Python, and then they move on with their lives without ever getting to the bottom of it. But if you're one of the curious ones who couldn't put that thought down, this article is for you.

Python is a language specification

Python is a programming language. A programming language is an abstract concept. It's fundamentally a set of rules saying what you're allowed to write in the language, the syntax of the language, and another set of rules saying what should happen if you execute some (syntactically valid) code, the semantics of the language.

For example, take the pass statement. The syntax of the pass statement is incredibly simple, it's just the keyword pass:

pass_stmt ::=  "pass"

The semantics are equally simple.

pass is a null operation — when it is executed, nothing happens. It is useful as a placeholder when a statement is required syntactically, but no code needs to be executed, [...]

For example:

def funtion_that_does_nothing():
    pass

The pass statement could be implemented in any number of ways. It could correspond to a NOOP instruction that when executed does nothing. It could also correspond to nothing, i.e. no instruction is generated for it. There are really endless possibilities in producing the semantic behavior described by the Python language reference.

CPython is an implementation of the Python language specification

CPython is an implementation of the Python language specification. In fact, it is the reference implementation, meaning that its runtime semantics are the law. It is necessarily the case that not all runtime semantics are precisely described in the Python documentation, and then whatever CPython does is considered the desired behavior. Unless, of course, it is identified as a bug in the runtime.

When it comes to the pass statement, CPython takes the "nothing" approach of not generating an instruction. This has less overhead than generating and executing a NOOP instruction. However, as the effect of those two approaches are the same, it would be equally correct to take the NOOP instruction approach. So CPython is a reference for what happens when you execute some code, not how that happens.

There are other implementations of the Python programming language, such as PyPy. It is fundamentally different to CPython in a variety of ways. For one, PyPy is implemented in Python, whereas CPython is to a large extent implemented in C. PyPy also employs a JIT (just-in-time) compiler that can compile hot code to machine code to substantially speed up running times, often making it substantially faster than CPython.

Closing words

The difference between Python and CPython is that the former is a language specification while the latter is an implementation of said language specification. However, in most everyday conversations among developers, they are one and the same. There is also the fact that CPython being the reference implementation makes it part of the specification of the language. Python as a programming language and CPython as an implementation are to some extent mutually dependent: you can't really define one without the other.

Distinguishing at all between the two may seem nit-picky, and in the vast majority of cases I'd argue it is. But when module documentation mentions implementation details of CPython or you find mentions of other implementations altogether, the distinction becomes important for it all to make sense. For me personally, just the urge to understand why two different words were used for seemingly the same thing was enough of a justification to dig into it, but I'm also hoping that this article brings more sense into the Python worlds of others.

The pre Python 2.5 ternary operator hack

2022-11-06T00:00:00+01:00

The modern day ternary operator is well-known to most Pythonistas:

<expr_if_true> if <condition> else <expr_if_false>

It's officially known as a conditional expression and was introduced back in Python 2.5 with PEP 308. Some like it, some don't, and while discussing it with a colleague of mine he mentioned that there "used to be something a whole lot worse" around the code bases written by developers favoring ternary operators. He couldn't remember what it looked like as these events are 15 years in the past, but a few days later he came back to me with a code snippet like this:

["nope", "yep"][False]

What?

No ternary operator you say?

Developers are creative and opinionated. Sometimes this mix leads to monstrosities, such as when creative developers who really liked ternaries created this pattern:

[<expr_if_false>, <expr_if_true>][<condition>]

You're reading that right. It's a list with two elements, the first of which is returned if the condition is False and the second if it's True. Here is an example:

condition = True

message_modern = "success" if condition else "failure"
print(message_modern)   # success

message_old = ["failure", "success"][condition]
print(message_old)      # success

This will make perfect sense to a C programmer: True and 1 are interchangeable, as are False and 0. And that's precisely how this works, the underlying function that implements the list index access performs some rudimentary checks and then just uses the provided index as an offset to the base pointer of the list.

But the hack isn't quite the same

At first glance, it may seem like the old hack with the list is functionally equivalent to the modern ternary operator. It isn't quite, though, because it lacks one very important property: lazy evaluation of the branches. In short, the ternary operator only evaluates the branch that it returns. Here's an example:

condition = False

result_modern = 1 / 0 if condition else 42
print(result_modern) # 42

result_old = [42, 1 / 0][condition] # raises ZeroDivisionError: division by zero
print(result_old)

The ternary operator does not result in a crash on division by zero as it does not evaluate the expression in the true-branch, whereas the old hack crashes immediately as it first evaluates both expressions and then returns one of them. To fully emulate the modern behavior we'd need something like this:

condition = False

result_old = [lambda: 42, lambda: 1 / 0][condition]()
print(result_old) # 42

Here we get lazy evaluation by virtue of wrapping the two branches in lambdas, and then executing the lambda that's returned. I think we can all agree that's not a pretty sight.

Conditional expressions are a good thing

Regardless of your stance on using ternary operators (or conditional expressions, as they're called in Python), it's probably a good thing they exist. Otherwise creative and opinionated programmers get around to hacks to emulate the behavior that end up being completely unreadable to others.

Why I write a blog that nobody reads (and you should, too)

2022-10-29T00:00:00+02:00

I don't write this blog in hopes that it will get a lot of traction. I don't market it, I don't try to optimize it for SEO and I I don't even tell my friends and coworkers about it. Although I would of course be delighted to hear someone who found something useful in my blog, I really write it for myself. And what could I possibly get out of that? Glad you asked.

Expressing thoughts and ideas is hard

I'm a software engineer. My job is mostly about coming up with neat solutions to tricky problems. Coming up with the solution in my head is however just part of the fight as I also need to be able to concisely express the ideas to my colleagues. Even harder is to convey the heart of a technical problem or solution to a non-technical person, which happens more often than you might think. For example, a manager might come around and inquire about why something still isn't working as it should or why project X got so expensive.

Writing a technical blog about helps me practice this skill. I put quite a bit of care into most blog posts I write and I also often look back at earlier posts to consider how I can improve the writing.

Retaining knowledge is also hard

Another reason for why I write this blog is to retain knowledge. Writing about something you've just learned is a great way to further cement that knowledge. As I put a large amount of care into my posts and want to be sure that I get things right, I also often pick up bits and pieces that I missed when attaining the knowledge I'm writing about.

As I noted in the post about my new Wiki, my strive toward high quality posts does unfortunately mean that I don't publish blog posts all that often. That's where the Wiki comes in. In there I simply write things down without any particular care about quality. I can thereby help myself retain knowledge about things that I don't yet understand well enough to write a blog post about, or that I simply don't have the time to write about.

Putting yourself out there is perhaps even harder

Exposing your work to the public is not something most people are comfortable with. It's easy to feel self-conscious about the things you create and be afraid of potential backlash or negative feedback. Putting up blog posts strengthens my confidence when it comes to communicating and expressing myself, even if I don't typically get any direct feedback (be that positive or negative) due to not having comments on this site. Similarly, putting up my projects on GitHub strengthens my confidence in my technical skills.

Having published my work for many years I now feel little to no anxiety about publishing my work. Rather, I find it enjoyable to put something out in the open where others may find it, even if few ever do.

Introducing my Wiki!

2022-10-24T00:00:00+02:00

A while ago, I decided that I wanted to have a low-effort way of recording what I learn out in the open. When I publish a blog post I feel compelled to keep the standard of the writing on a fairly high level. I've started dozens of articles that I never published because I couldn't massage them into a state that meeting my own standards. And that's not even counting the hundreds of articles that I never even started because it seemed like too much of an effort to record a tiny learning experience.

This strongly goes against the Record What You Learn an Share What You Learn patterns from Apprenticeship Patterns; two patterns that I value highly. Enter the Wiki!

My public Wiki

The Wiki can be reached from the menu bar at the top of the page. Here, I jot down quick notes about what I learn, what I have yet to learn and anything else that I feel is worth recording. The quality of the writing will range from poor to worse, and it'll be haphazardly organized, at least initially. I hope to turn this into a wealth of knowledge both for myself and for others in the coming years.

Why not use an established Wiki software?

Although there are several open source Wikis out there, such as XWiki, they're usually quite heavy and require a backend. This site is a static site, which is very cheap to host and requires no particular security considerations due to the lack of a backend. I quite like those properties, and I'm currently adapting the templates for this site to better suit my Wiki ambitions. This first rendition is rough, but it suits my current needs, and improving upon it as my Wiki grows wll be an interesting challenge.

Searching the Wiki

The Wiki is indexed along with the rest of the site and can be searched using the search bar on the home page. In other words, you can search for anything on this website through that single search bar!

Going forward

I intend to keep improving the wiki. As the amount of content grows, I expect to have to improve the layout, cross-document linking and other related features. I'm very excited to finally get around to this as I've been thinking about it for a long time. Even though I do hope others will find it useful, it's mostly for my own gain, and I'm confident it will serve me well.

Book Review: The Rust Programming Language

2022-08-10T00:00:00+02:00

I've been learning Rust on and off for the past few months, and The Rust Programming Language has been my primary learning resource during this time. It's a great introduction to the language, and is even freely available online.

Here, I'm reviewing the 2018 print version. Although a little bit out of date by now, there's nothing that's become obsolete, so I can still recommend even this version. For the most up to date version, the online one is however the way to go.

The Rust Programming Language
by Steve Klabnik, Carol Nichols
Released June 2018
Publisher(s): No Starch Press
ISBN: 9781593278281

The book in a nutshell

The Rust Programming Language is really a book where the title perfectly captures what the book is about. It teaches Rust mostly by practical examples, and for the most part the examples are self-contained and executable. For some concepts, examples are either missing or more illustrative than executable, but these are few and far between.

The book also to a large extent explains programming concepts, and has a rather elaborate section on concurrency. It however isn't on a beginner level, and doesn't go out of its way to intuitively describe what a variable is, or how looping works. I would rate this a great book for someone who is already somewhat familiar with programming concepts, but it's not the best resource for getting started with programming as a whole. It's the perfect "second language book".

What I liked

This book is incredibly well written and organized. Concepts to be learned are first presented on a high level, and then the authors drill into the details. But not too far into the details; at several points we are referred to other resources to acquire a deeper understanding for certain concepts. It's also a great standalone resource for learning Rust as it covers surrounding tooling like rustup and cargo in addition to the language itself. You can learn Rust to a decent level of proficiency from this book alone.

The well thought out pacing of the book carries over to the code samples, which are excellent through-and-through. There are some rather tricky concepts to get your head around in Rust compared to other programming languages, and the code samples are crucial in getting the point across. These are also presented in a top down fashion. By that, I mean that the higher level code is presented first containing calls to yet to be defined functions that are presented later.

As a crude example, imagine that we want to present a program that adds two numbers and prints the result. That may look like so:

fn main() {
    let sum = add(1, 2);
    println!("{}", sum);
}

And then we define the add function after having presented the high-level idea we want to implement:

fn add(lhs: i32, rhs: i32) -> i32 {
    lhs + rhs
}

This top down approach to presenting code samples really helps in getting a good idea for what needs to be done before how it is actually implemented. I strongly prefer this approach to a bottom up one, where you start with the low level how before getting to the high level what.

The book ends with a project on building a multithreaded web server, which is meant to solidify many of the concepts taught throughout the book. It's a great way to close out a great book.

What I didn't like

This is my first book review where I can't come up with something that I overtly did not like about a book. There are things the book lacks, such as more beginner-friendly introductions to core programming concepts, but I feel that's by design rather than thoughtless omission. The book would simply be way too long if it had to include such things as well.

Perhaps I will find something to be annoyed with as I revisit this in the future, but as it stands I am completely pleased with my reading experience.

Conclusions

This is a great book to learn the Rust programming language. It's not appropriate for absolute beginners, but I think that may be simply a consequence of Rust being designed to tackle rather advanced problems. I would not recommend a budding programmer to start out with Rust, and so it seems completely natural to me that the official learning resource doesn't cater toward such a crowd. That being said, you don't need to be a seasoned programmer to get value out of this book, as concepts are explained with quite a lot of "backstory". You don't need to be overly familiar with the problems Rust attempts to solve (memory safety, for example) as the book clearly exemplifies the problems before outlining the solutions.

As The Rust Programming Language is freely available online, I whole-heartedly recommend it for those looking to dive into Rust. I see no good reason to go looking elsewhere for resources when there's such a great one staring you right in the face. This is the starting place for a prospective Rustacean!

Book Review: 97 Things Every Programmer Should Know

2022-07-11T21:00:48+02:00

97 Things Every Programmer Should Know is a collection of short essays by experienced programmers. And by short, I mean short: 1-3 fairly tiny pages a piece. If you're on a journey to become a software engineer then this book will give you a crash course in terminology you should be familiar with. Even as a practicing software engineer there is wisdom to be found in this book, but a novice will undoubtedly get more out of reading it.

97 Things Every Programmer Should Know
by Kevlin Henney
Released February 2010
Publisher(s): O'Reilly Media, Inc.
ISBN: 9780596809485

The book in a nutshell

As I mentioned in the introduction, 97 Things Every Programmer Should Know is a collection of very short essays by practicing professionals. The essays treat a wide variety of themes that are relevant to a practicing software engineer, ranging from hard technical skills such as the Don't Repeat Yourself principle and Single Responsibility Principle, to softer skills such as interacting with managers and fostering good relationships with your colleagues. Most of the essays are focused on the technical side of things, however, and there is good variety in the technical topics. While a lot of the essays are about how to write good code, there is also a healthy amount of recommendations for tooling to use, such as static code analysers, automated test suites, version control systems and more.

There really isn't much more to say regarding what the book is about. It's like a collection of very well-written blog posts on programming-related topics. So if you're reading this blog, chances are good you're going to enjoy this book.

What I liked

In some ways, 97 Things has a lot in common with Apprenticeship Patterns that I reviewed last week. They're both about how you improve as a software engineer. The difference is that 97 Things is a lot more to the point and more concrete. For the most part, it presents tips and tricks that you can apply immediately and see benefits from just as fast. For the budding software engineer, it's an excellent pool of topics to diversify your skill set. That's with an emphasis on topics, though. This book presents a brief introduction to a wide variety of topics, but it dives deeply into none of them. I think this is a great strength of the book, as it means you will never get stuck on some "boring" topic that doesn't interest you.

While you can absolutely draw connections between the essays, they are written as standalone pieces of work. This makes 97 Things a perfect book to read on the go, when you might just have a few minutes or so to read.

Out of all 97 things, I recall only a handful that I didn't find genuinely helpful or insightful. Some essays even contradict each other, which actually gives a nice perspective on the fact that a lot of best practices are, to a large degree, opinions. A notable example I recall is about automatic code formatting, where one essay discourages its use and a few others encourage it. I think this is a great benefit of having so many different authors. You don't just get one person's opinion.

What I didn't like

As with many other books on how to be a good software engineer, there is a slight tint of workaholism over some of the essays. I especially found an essay by Robert C. Martin on what it means to be a professional programmer to send this message. It's the same theme I found a little bit disturbing with Apprenticeship Patterns. I can't say I disagree with the message; I truly believe in the craftsmanship approach to software engineering, the path of lifelong learning. But at the same time I don't think it is for everyone, and I think it should be possible to treat programming as "just a job".

A minor inconvenience is that the essays are ordered alphabetically, where I would have preferred them to be ordered by theme. Finding a particular essay of which you recall the theme but not the title is needlessly difficult. I just now suffered through it trying to find the aforementioned essay by Robert C. Martin.

But as a package, I find little to dislike about the book. Even the workaholism part is effectively counteracted by an entire essay dedicated to sending the message "work smarter not harder".

Conclusions

This is another book that I wish I'd read years ago. I strongly recommend it as a read for any software engineer at the beginning of their career or student (self-taught or at a seat of learning) who is preparing for their career. Being such a light and quick read, I find no good reason not to spend the few hours it takes to read through the book and be exposed to a whole lot of different ideas.

Full text search with pelican-search!

2022-07-10T21:58:53+02:00

I've been meaning to take this blog into more of the Wiki direction. Or rather some unholy mixture of a Wiki and a blog. An important step in that direction is full text search, which I've now got!

This blog is powered by Pelican, and quite recently a new plugin called pelican-search popped up. It's quite simple: it adds the search bar you see on the home page, which taps into the powerful Stork search library.

It all sums up to the super fast search bar that all visitors of this website now have at their disposal. Pretty neat!

The art of learning from the less experienced

2022-07-07T22:31:50+02:00

Software engineering is a lifelong journey of learning. Regardless of how dedicated you are in your learning, there will never come a point where you have learned it all. As such, it's important to use all learning resources available to us. As is evident from the themes on my blog, I'm very partial to books and other written media. Indeed, this blog post is such written media. But perhaps the best source of learning for a software engineer is simply other software engineers.

Now, learning new things from those more experienced than you isn't that much a leap of the imagination. Of course you're going to try to soak up anything you can learn from your seasoned team lead, or that database expert who does magical things with SQL queries, or anyone else you identify as being highly proficient in something that interests you. No, there is no real challenge there, barring the fact that these experienced engineers may not have the inclination to teach you. But this blog post is about your disposition as a learner, so let's stick to the topic. And actually get to the topic to begin with: learning from the less experienced.

A student can teach their teacher

When I attended university, I worked many years as a teaching assistant in introductory computer science classes. During one class I held, there was at some point a part of an assignment that required flipping the value of a boolean variable. Something like: given a boolean variable isOdd, define a new boolean variable isEven that is true if isOdd is false, and false if isOdd is true. One student presented a solution like the one below.

boolean isEven;
if (isOdd == true) {
    isEven = false;
} else {
    isEven = true;
}

Being an enthusiastic but still rather fresh teaching assistant with not all that much programming experience, I said it was a viable solution but it would be more concise to use a ternary operator.

boolean isEven = isOdd ? false : true;

The students sat back in awe at my incredibly simple solution to the problem. That is, until 5 seconds passed and another student had a bright idea: "why not just negate isOdd?". What the student meant was the following:

boolean isEven = !isOdd;

Not only is this solution the most concise, it also more clearly represents the concept the task asked for. Something is "even" precisely if it is "not odd", after all. I managed to humble myself enough to commend the student for a well thought out solution.

First year students routinely taught me new things

I taught the first year computer science courses for four years. I expanded my skills exponentially during this time. And yet, every year there would be new first year students that knew something I did not, or had some insight I lacked. While these events definitely became less frequent as I gained more experience, they never ceased. I'm confident that I could go back there now and teach the same courses again, and there would be a student or two with something to teach me.

The great insight that I gained from this is that regardless of how far ahead you are of someone else, you are doing both yourself and them a disservice by not being open to let them teach you things. It's also incredibly hard to determine if someone is less experienced than you are. In the !isOdd scenario outlined above, I was in a position of authority relative to the student who had the best solution, but it's not unlikely that student had done a lot more programming than I had, given that I didn't start until I was in university.

The great challenge in keeping an open mind in disagreement

The reason I could so easily swallow my pride and commend the student with the !isOdd solution is not that I at that time was particularly humble. I simply agreed with the solution the student had in mind, it fit my mental model. I've since been in situations where someone I've viewed as less experienced (and more importantly, less proficient) than myself has come with a suggestion that I've fundamentally disagreed with. In such scenarios keeping an open mind is a lot more difficult, and all I can do is try to the best of my abilities. I will argue my point, and I can argue fiercely, but I also try my absolute hardest not to dismiss their point outright, and hear out their arguments. I also try not to let my predetermined view of their experience and proficiency taint my judgement. Sometimes I succeed on the spot, and sometimes I succeed in hindsight when reflecting over a past conversation. And most assuredly, sometimes I simply fail.

My point with all of this storytelling really boils down to one piece of advise: avoid leaning on your impressions of someone's experience and proficiency when evaluating their arguments for some point. I guarantee they know things you don't. Like me, you are unlikely to always succeed, but you'll benefit from the times you do. Not to mention that the other party of the argument will most often appreciate you letting them make their case. Perhaps that's actually the more important part of thes story. But it sure does not hurt that there's something in it for you as well.

Book Review: Apprenticeship Patterns

2022-07-02T13:00:48+02:00

Apprenticeship Patterns: Guidance for the Aspiring Software Craftsman is fundamentally a book about lifelong learning. It is about treating software engineering as a craft you may never master; in fact it may never have been mastered before. Perhaps it simply cannot be mastered in the traditional sense of the word. But that doesn't mean that we should not aspire to master it, that we should not embark on The Long Road. So how can a book help with that?

Apprenticeship Patterns: Guidance for the Aspiring Software Craftsman
by Dave Hoover, Adewale Oshineye
Released October 2009
Publisher(s): O'Reilly Media, Inc.
ISBN: 9780596518387

The book in a nutshell

The core tenet of the book is that software engineering is a craft. A craft in the same sense that blacksmithing is, or carpentry, or the construction of musical instruments. The authors provide a concise reasoning for why this is so.

Software engineering is a craft precisely because we don't understand it well enough to make it a codified discipline like science or engineering.

One can agree or disagree with this statement, but it is hard to argue against the notion that software engineering is something that requires a high degree of skill. That's not skill as in technical proficiency, although that is part of it. It's skill as in everything that encompasses a successful software engineer, of which technical proficiency is important but far from being the skill in it's own right.

As the title suggests, Apprenticeship Patterns is comprised of a series of patterns. These are meant to help you hone your skill as a software craftsman. Each pattern is composed of four parts:

Context: A generalized context to put the pattern in perspective.
Problem: A concrete problem statement.
Solution: One or more suggestions for solving or alleviating the problem.
Action: A concrete exercise to practice the solution.

The patterns are really contextual habits; given a situation X it is appropriate to do Y. A (perhaps the) core pattern is The Long Road, which boils down to the fact that mastering a craft is a lifelong process. It challenges the notion that the quickest way to success as measured in notoriety or material wealth is what one should aspire for. The solution is a lengthy affair, but it importantly suggests that climbing the corporate ladder through quick promotions or similar easily takes you away from the actual crafting of software, thus diverting you from The Long Road. You should be prepared to work as a developer for many years to come. Many of the other patterns of the book complement The Long Road, such as Stay In The Trenches which specifically deals with the problem of success being rewarded with promotions.

Another pattern that struck home with me is Record What You Learn. Quite unsurprisingly, it simply suggests that one should keep a record of the things you have learned and plan to learn in the future. This book review is in fact a direct application of that pattern.

There are many, many more patterns in the book. Some of my favorite ones include:

Be The Worst: Place yourself in situations where you are surrounded by craftsmen that are more skilled than you in some area, and that you can learn from. Avoid becoming complacent.
Share What You Learn: Put your learning experiences out in the open, for anyone to find.
Kindred Spirits: Surround yourself with others passionate to learn the things you wish to dive into.
Use The Source: Read open source code. A lot.
Read Constantly: Read books. A lot.

I'm not going to list all the patterns that resonated with me, and I can't even do these few ones justice with just this brief explanation of them. You really need to read the book with each context, problem, solution and action for the respective patterns to get the full picture.

What I liked

This book is a very easy read. I like to have at least one soft book to read when my mind wanders too much for me to take in a technical book, and Apprenticeship Patterns perfectly fits that bill. It took me a couple of weeks with a few pages a night to get through it. It's easy to pick it up, read about a pattern or two, and then put it down. It's well organized and easy to refer back to after completion. The patterns are presented with just enough context to make them understandable, yet the authors do not dwell on things for too long. It is a concise book that is still easy to comprehend.

I also found that it is a highly inspirational book, and approaching software engineering as a craft really speaks to me. Many of the patterns of this book are quite obvious to me and I practice several of them already, yet there are many patterns I think I should practice that I don't. Record What You Learn is the perfect example. I've been thinking for years that I should do so, but never really got around to doing it other than a sporadic blog post once or twice a year. After reading the book, I have newfound motivation to apply many of the patterns, as this book review is tangible evidence of. In the future, I intend to always finish up a book with a book review. Hopefully I will improve in writing them with time, as I am not all that happy with how this review turned out. But Share What You Learn tells me I should post this anyway, and I really do agree with that.

I also think this is a terrific book to read as a budding software engineer. It is called Apprenticeship Patterns, after all. I wish I would have read it years ago, and I wish I was already practicing many of the patterns. But better late than never, and as I will discuss in the next section there is a reason for me to be somewhat thankful for not reading it earlier.

What I didn't like

While I think this is a good handbook for mastering a craft, I also think the approach is potentially unhealthy if applied without moderation. I have personally struggled a lot with finding a balance between improving my skills as a software and just living a life separate from software engineering. Or computers and technology in general. I found the book truly inspirational, so much so that it prompted be to spend two hours of my Saturday morning writing this book review. To be completely honest, I am still tweaking the aforementioned balance. I would not say that I am struggling anymore, but I cannot deny that it is still a work in progress.

Conclusions

This book struck home with me, and I wholeheartedly recommend it. But that is a recommendation with a caveat, as I don't think this is a book for everyone. It's really all in the title, this book is for the aspiring software craftsman. I don't think you have to approach software engineering as a craft that you devote yourself to, and the authors actually allude to this as well. It's one approach, and it's a great boon to the field that some take it. But for others, working with software can be "just a job", and not their passion. That is fine, a job can be just a means to provide for yourself. And if that's you, then I say skip this book. But if you do have a passion for building great software, I think this is a book you don't want to miss.

Eleven Table Tennis: A VR masterpiece

2022-06-12T00:00:00+02:00

As a software engineer, I sit and stand still for large chunks of the day. Unfortunately, I'm also a big fan of playing video games, which traditionally falls into the same category of physical exercise. Much to avoid repetitive stress injury and everything else that comes along with sitting at a computer day in and day out, I've in the past year started to devote more time to virtual reality (VR) gaming. In some VR games you actually get to move around, and I'm writing this article to fawn over my favorite experience yet: Eleven Table Tennis.

Table tennis is an excellent fit for VR

There are two things about table tennis that makes it very well suited to VR. First, the playing field is small enough that it can fit into a fair-sized living room. Granted, fitting a full-sized table tennis table into a smaller apartment can be somewhat inconvenient, but when playing in VR you actually only need the space for half of the table. That is not to say that I don't occasionally physically assault some furniture that was unfortunate enough to come in my way, but most of the time it works well even though my playing space is only roughly 2.5x2.5 meters.

Second, the real-life feedback of hitting a table tennis ball with a racket is rather realistically reproduced by a vibration in a motion controller. There's just not a lot of feedback from those tiny and hollow table tennis balls, so even though most motion controllers can't produce particularly strong vibrations, it's still enough. I've played both with a Valve Index and an Oculus Rift CV1, and both provide a perfectly serviceable experience.

Eleven Table Tennis delivers on that premise

So the actual game of table tennis is really well suited to a VR treatment. Eleven Table Tennis takes this premise and delivers an almost realistic physics engine that for the most part just feels right. Granted, there are the odd "wait, what?..." moments from time to time, and I especially feel like some serves can be delivered in a way that's not really possible in real life. But overall, it feels true to life.

While playing against bots in Eleven Table Tennis is a perfectly fine challenge, what really sets the game apart is a stellar online multiplayer. I jump into the game both in early mornings before work and in late evenings before bed, and it rarely takes me more than 30 seconds to find an opponent. While waiting, you can keep playing against the bots as well, so there's never an idle moment.

As icing on the cake, playing table tennis in VR is a really decent workout. I'm over the moon about how I can combine my love for video games with physical exercise. There are games that do that even better, such as Beat Saber, but I find the competitive nature of table tennis to be much more engaging and worthwhile.

If you're a VR fan and don't have this game yet, do yourself a favor and get it. Just ensure that you don't have any expensive furniture in your immediate play area.

Learning a new programming language as a practicing software engineer

2022-06-12T00:00:00+02:00

When it comes to programming languages, I consider myself something of a polyglot. To me, learning a new language is one of the most enjoyable things in all of software engineering. This is especially true when you start to venture into new paradigms; going from procedural to functional, functional to logical and logical to constraint-based. Exploring programming languages is one of the best way to improve your craft, even if you don't end up using that language in your daily work.

Having learned (to varying degrees of proficiency) a good few languages over the years, I figured it's about time I share my process for learning a new one. And when I say learn here, I mean really learn the language and become proficient in it, as opposed to learning just enough to become dangerous. There is a time and place for the latter as well, but that is not the focus of this article. This article is also not about learning your first programming language, as that's a process where you need to also learn programming as a skill.

I'm currently in the process of learning the Rust programming language, and in this article I'll take you through how I go about learning it. My process, which I will detail throughout this article, is as follows:

Identify learning resources
Start learning
Build a project
Maintain knowledge

Step 1: Identify learning resources

The first thing I do is to identify the learning resources that I intend to use. This does not have to be an exhaustive list of resources, as sometimes I stumble upon more of them as I go along, but I always put in some amount of research into which learning resources are the most well received by others. I divide these into primary and secondary learning resources, where the primary ones guide me through the learning experience while the secondary ones are supplementary and can be skipped altogether if time is tight.

Primary learning resources

My primary learning resources are one or more structured overviews of the language I'm about to learn. I usually only aim for one or two primary resources, and in the vast majority of cases I go with books. The reason I prefer books over things like video courses is that I find books to be better suited to active learning. It is incredibly easy for me to watch a video and just start thinking about something else. In fact, that's something I do even when reading a book, but to a lesser extent.

Sometimes, finding a good primary learning resource is easy. In the case of Rust, the community maintains a book called The Rust Programming Language, which is the recommended starting point for newcomers. After that, I intend to continue with Programming Rust, which appears to be more in-depth. In some cases, identifying a good primary resource may be far from trivial, and requires a fair amount of looking around. But for Rust it did not take me much time at all to find what I was looking for.

I am aware that many prefer video resources over books nowadays. If that fits your learning style, then that's all good. Books are just where it's at for me personally. Sites like Udemy and Pluralsight provide an ample selection of courses suitable as primary learning resources for most programming languages in use today. In my experience, such courses do however often cater to people learning programming, rather than software engineers just looking to add a new language to their repertoire.

An important note on my primary learning resource is that I never follow more than one at any one time. That's essentially what makes it a primary resource. So right now, I'm going with The Rust Programming Language first and will then move on to Programming Rust, but I don't read them at the same time.

Secondary learning resources

Secondary learning resources are complementary, both to primary learning resources and other secondary learning resources. I most often have at one or two secondary resources that I utilize in parallel with a primary one. My favourite sources of secondary learning resources are conference talks and podcasts.

For Rust, I found a rather high-quality podcast called New Rustacean, which features the host's journey of learning the Rust programming language. It's a nice complement to my reading, and I enjoy being able to listen to it on the go. If you're about to learn Python, see my post on Awesome Python Podcasts for some inspiration.

Talks are most easily found on YouTube. Many of today's popular programming languages have at least one yearly conference, while a lot of them have many more. Python as various incarnations of PyCon, Rust has RustConf and C++ has CppCon, and you'll find that pretty much any language has a yearly conference named <Language>Con(f). Some put all talks on YouTube, while others are a bit less easy to get a hold of.

There is one more detail about secondary learning resources that I think is worth noting, namely that they keep being useful to me long after I've stopped investing time in primary learning resources. Secondary learning resources can also enter my radar when I'm already a well-rounded programmer in a given language. I will touch more on this when I discuss maintaining knowledge of a programming language.

Step 2: Start to learn

In the initial step of learning, I simply sit down with my chosen primary learning resource and consume only that. As I'm a book person, this entails sitting down for a nice read. Although a very common advice is to learn actively by trying out examples and writing your own code as soon as possible, I personally prefer to just read for a while before I start dabbling with code myself. Sometimes I'll try a small example or other that looks extra interesting, but I generally wait with any serious amount of programming until Step 3 of my learning process. The goal of Step 2 is not really to learn the language, but to get a good overview of it.

I often start using my chosen secondary resources immediately after my first sitting with a primary resource, at least if there's at least one podcast lined up. What I mostly seek from secondary resources is to get an idea of the community surrounding the programming language, what the ecosystem (e.g libraries and frameworks) looks like and keep up to date with the development of the core language. More on this in Step 4.

Step 3: Build a project

As soon as I feel like I have touched all subjects I need to sit down and start up a small project, that's exactly what I do. This point usually rolls around when I know how to create a project in the language, how to write unit tests and how to write documentation. I'll often have a particular project in mind, and then I may also know about a few additional things I need to learn about before I get going.

My chosen project for learning Rust is to create a compiler and runtime environment for a statically typed Python-like programming language that I call Rusthon. This is an open source project and you can find it over at Rusthon's GitHub page if you're interested.

Like I mentioned in Step 2, this is where the advice of active learning comes into play. It's virtually impossible to become proficient with a new language you're confined to following along with examples and solving tiny problems without context. You need something larger to work on, I really do think this is essential.

Step 4: Maintain knowledge

Once I've learned a language well enough to become proficient in it, I need to maintain that knowledge and also keep up-to-date with developments in the community and ecosystem. These are two rather separate concerns. Maintaining a working knowledge of the language requires one to write code in it. For this I keep using my project, or find new things to do. I often try to get involved in open source projects, such as RepoBee (Python) and Spoon (Java) to make meaningful contributions to the community.

Keeping up-to-date with the community and ecosystem is where I mostly keep using secondary learning resources, mostly podcasts but also talks from conferences. This may be less important in older languages with less vibrant ecosystems, but in "newer" languages like Java, Python and Rust, knowing your way around the package ecosystem and keeping up-to-date with new language features is rather important.

Final thoughts

Learning a new programming language is something I enjoy greatly. I go about it in a rather rigorous fashion where my goal is to become really proficient. I immerse myself in the language, its community and its ecosystem. This also entails that I spend some amount of effort in maintaining my knowledge and grasp on the language, leading to the selection of languages I call myself proficient in being small. When I learn a new language it often effectively replaces something already in my repertoire. Rust is a contender to replace C, although I can't say I've been all that thorough in maintaining my skills in C in the past few years.

It is also worth considering that learning a new language without maintaining your skills in it for any considerable amount of time can still be worthwhile. I think this is especially true when exploring different paradigms, as I alluded to in the introduction of this article. I learned Haskell many years back but did not maintain a working knowledge of it for very long. Yet, the concepts that I learned from being forced to code in a purely functional fashion are valuable to me to this day. It was thanks to programming in Haskell that I really grasped the concepts of recursion and higher-order functions, and understanding that has greatly benefited me in all languages I've practiced since.

To summarize, learning a new language can be greatly beneficial. Even if you don't intend to add it to your repertoire, it can still be a worthwhile effort to step out of your comfort zone and explore new programming paradigms. The things you learn from programming in one language can benefit your programming in another, and make you a more well-rounded software engineer.

RepoBee at ITiCSE and SIGCSE 2021!

2021-09-09T21:57:00+02:00

Another year, and a few new papers published on my favorite project: RepoBee. We made a hat-trick and appeared at ITiCSE 2021 for a third year in a row (see ITiCSE 2019 and ITiCSE 2020), but this time we managed to sneak in two papers. One paper that I presented detailed RepoBee's double-blind peer review, and the other paper that was presented by my colleague Tobias showcased the repobee-sanitizer plugin. Amazingly, the presentations won both "Best Presentation" awards for their category.

We also made an appearance at SIGCSE with a 20-minute demo and Q&A. The teaser trailer for the demo is available here. The demo itself was essentially that, but less compressed and a little more elaborate. It was followed by a nicely active Q&A, where we made some new connections.

All my publications can be found on the Publications page.

RepoBee at ITiCSE 2020!

2020-10-04T09:20:00+02:00

In 2019, I presented RepoBee at the ITiCSE conference in Aberdeen. This year, ITiCSE went virtual, but RepoBee still made an appearance in the new tools and tricks section with a small, two-page paper. Although I let my co-author Ric Glassey deal with the virtual presentation, I'm still quite proud that I had another paper on RepoBee published at one of the major tech education conferences.

If you're unaware, RepoBee is a tool for managing Git repositories in an educational context. A basic use case is for a teacher to have a template repository, and create copies of it for students or groups of students on GitHub or GitLab. If you've heard of GitHub Classroom, it's the same concept, but RepoBee is far more powerful and customizable, at the expense of being a little more intricate to use.

In the paper, Ric and I mostly discuss the need for customizable tools to tailor to different teaching methodologies and preferences, and how RepoBee makes this possible through plugins and by supporting both GitHub and GitLab. We also discuss the two primary modes of RepoBee, and how they can be combined. In dictate mode, RepoBee creates repositories for students based on template repositories. In discovery mode, RepoBee only sets up student teams/groups, and then the students themselves are responsible for creating the repositories, which RepoBee can then "discover".

The full paper is available in the ACM digital library.

Essential pytest pt. 3: Rerunning failed tests (and the pytest cache)

2020-10-03T19:00:00+02:00

This is the third part of a series of small articles detailing some of the functionality of the pytest testing framework that I find most essential. The series assumes you know how to run tests with pytest already.

In this third part, we'll take a super quick look at the --lf flag that lets us rerun failed tests, as well as the caching mechanism that makes it possible.

Using `--lf` to rerun failed tests

In this article, we'll use the test suite from the first article.

# test_mul.py
def mul(lhs, rhs):
    return lhs * lhs

def test_multiply_equal_numbers():
    assert mul(5, 5) == 25

def test_multiply_by_zero():
    assert mul(1, 0) == 0

def test_multiply_different_numbers():
    assert mul(5, 3) == 15

Just like in that article, the implementation of mul is broken.

$ pytest -v --tb=no
========================== test session starts ===========================
platform linux -- Python 3.8.5, pytest-6.1.0, py-1.9.0, pluggy-0.13.1
cachedir: .pytest_cache
rootdir: /home/slarse/python
collected 3 items                                                        

test_mul.py::test_multiply_equal_numbers PASSED                    [ 33%]
test_mul.py::test_multiply_by_zero FAILED                          [ 66%]
test_mul.py::test_multiply_different_numbers FAILED                [100%]

======================== short test summary info =========================
FAILED test_mul.py::test_multiply_by_zero - assert 1 == 0
FAILED test_mul.py::test_multiply_different_numbers - assert 25 == 15
====================== 2 failed, 1 passed in 0.05s =======================

Note how 2 tests failed. pytest caches the failed tests from the last run, which enables us to rerun them with the --lf|--last-failed flag. So let's do that, and show some more traceback information while we're at it. Note that only the failing tests are executed.

$ pytest -v --lf --tb=short
========================== test session starts ===========================
platform linux -- Python 3.8.5, pytest-6.1.0, py-1.9.0, pluggy-0.13.1
cachedir: .pytest_cache
rootdir: /home/slarse/python
collected 2 items                                                        
run-last-failure: rerun previous 2 failures

test_mul.py::test_multiply_by_zero FAILED                          [ 50%]
test_mul.py::test_multiply_different_numbers FAILED                [100%]

================================ FAILURES ================================
_________________________ test_multiply_by_zero __________________________
test_mul.py:8: in test_multiply_by_zero
    assert mul(1, 0) == 0
E   assert 1 == 0
E     +1
E     -0
____________________ test_multiply_different_numbers _____________________
test_mul.py:11: in test_multiply_different_numbers
    assert mul(5, 3) == 15
E   assert 25 == 15
E     +25
E     -15
======================== short test summary info =========================
FAILED test_mul.py::test_multiply_by_zero - assert 1 == 0
FAILED test_mul.py::test_multiply_different_numbers - assert 25 == 15
=========================== 2 failed in 0.12s ============================

My primary use case for --lf is for sorting out bugs. Every time a test passes, it is removed from the last-failed cache, and thus does not run the next time --lf is specified. This way, it's easy to quickly target only failing tests, and systematically eliminate them one by one.

Pitfall: A common mistake is to use --lf to eliminate the failing tests one by one, and then forget to run all tests when the last of the initially failing tests passes. It's entirely possible to fix the implementation such that a test A passes, and then subsequently reintroduce the same problem in addressing another test, but at that point A is no longer executing with --lf.

Interacting with the cache

I mentioned that the failed tests from the last run are stored in a cache. This cache is located in the .pytest_cache directory of the current working directory. There are a few flags to interact with said cache. First, you can execute pytest with the --cache-show flag to show the current contents of the cache.

pytest --cache-show
========================== test session starts ===========================
platform linux -- Python 3.8.5, pytest-6.1.0, py-1.9.0, pluggy-0.13.1
rootdir: /home/slarse/python
cachedir: /home/slarse/python/.pytest_cache
-------------------------- cache values for '*' --------------------------
cache/lastfailed contains:
  {'test_mul.py::test_multiply_by_zero': True,
   'test_mul.py::test_multiply_different_numbers': True}
cache/nodeids contains:
  ['test_mul.py::test_multiply_by_zero',
   'test_mul.py::test_multiply_different_numbers',
   'test_mul.py::test_multiply_equal_numbers']
cache/stepwise contains:
  []

========================= no tests ran in 0.00s ==========================

Here, we can for example see the contents of the last-failed cache (cache/lastfailed), and the tests currently known by pytest (cache/nodeids). It's possible to supply --cache-show with an optional value, in order to show only some part of the cache. For example, --cache-show=lastfailed shows only the last-failed cache contents.

On occasion, the cache may get into an inconsistent state, typically due to strange interactions by the user. This has happened to me on several occasions, with tests simply not executing as I expect them to. At that point, supplying the --cache-clear flag to a test run will clear the cache. Alternatively, you may simply remove the .pytest_cache directory.

Summary

Being able to execute only the failing tests from the previous test run is a very handy feature when addressing bugs, both saving time in test execution and limiting the amount of output shown to the user. It's however important to remember to execute all tests after the last failing test passes, so as to check for regressions. One should also be aware that the functionality hinges on caching in the .pytest_cache directory, which on rare occasions may need to be cleared.

Essential pytest pt. 2: Selecting tests to run

2020-10-03T14:00:00+02:00

This is the second part of a series of small articles detailing some of the functionality of the pytest testing framework that I find most essential. The series assumes you know how to run tests with pytest already.

In this second part, we'll take a look at the -k and -m options to control which tests in the test suite are executed.

The test suite

In this article, we'll use the test suite from the first article.

# test_mul.py
def mul(lhs, rhs):
    return lhs * rhs

def test_multiply_equal_numbers():
    assert mul(5, 5) == 25

def test_multiply_by_zero():
    assert mul(1, 0) == 0

def test_multiply_different_numbers():
    assert mul(5, 3) == 15

Note that mul is now correctly implemented, so all tests will pass.

$ pytest -v
========================== test session starts ===========================
platform linux -- Python 3.8.5, pytest-6.1.0, py-1.9.0, pluggy-0.13.1
cachedir: .pytest_cache
rootdir: /home/slarse/python
collected 3 items                                                        

mul.py::test_multiply_equal_numbers PASSED                         [ 33%]
mul.py::test_multiply_by_zero PASSED                               [ 66%]
mul.py::test_multiply_different_numbers PASSED                     [100%]

=========================== 3 passed in 0.01s ============================

Now, let's learn how to run subsets of these tests, without modifying the source code.

Using the `-k` option to select tests by substring matching

The -k option is wonderful, and allows us to select a subset of tests to execute based on simple substring matching. The simplest use of -k is to provide it with a whitespace-less string. Any test with a name that contains that string will be executed. To be clear, the name of a test is e.g. mul.py::test_multiply_equal_numbers, that is to say, the qualified path to it.

As a simple example, we can select only the test that multiplies by zero like so.

$ pytest -v -k zero
========================== test session starts ===========================
platform linux -- Python 3.8.5, pytest-6.1.0, py-1.9.0, pluggy-0.13.1
cachedir: .pytest_cache
rootdir: /home/slarse/python
collected 3 items / 2 deselected / 1 selected                            

test_mul.py::test_multiply_by_zero PASSED                          [100%]

==================== 1 passed, 2 deselected in 0.05s =====================

Note that 2 tests were deselected. It is also possible to create logical expressions using not, or and and. not simply inverts the condition: any test that does not match the substring is executed.

$ pytest -v -k 'not zero'
========================== test session starts ===========================
platform linux -- Python 3.8.5, pytest-6.1.0, py-1.9.0, pluggy-0.13.1
cachedir: .pytest_cache
rootdir: /home/slarse/python
collected 3 items / 1 deselected / 2 selected                            

test_mul.py::test_multiply_equal_numbers PASSED                    [ 50%]
test_mul.py::test_multiply_different_numbers PASSED                [100%]

==================== 2 passed, 1 deselected in 0.05s =====================

With or, we can select tests that match any of a number of substrings.

$ pytest -v -k 'equal or different'
========================== test session starts ===========================
platform linux -- Python 3.8.5, pytest-6.1.0, py-1.9.0, pluggy-0.13.1
/usr/bin/python
cachedir: .pytest_cache
rootdir: /home/slarse/python
collected 3 items / 1 deselected / 2 selected                            

test_mul.py::test_multiply_equal_numbers PASSED                    [ 50%]
test_mul.py::test_multiply_different_numbers PASSED                [100%]

==================== 2 passed, 1 deselected in 0.06s =====================

Finally, and allows us to select tests that match multiple substrings.

$ pytest -v -k 'multiply and equal'
========================== test session starts ===========================
platform linux -- Python 3.8.5, pytest-6.1.0, py-1.9.0, pluggy-0.13.1
/usr/bin/python
cachedir: .pytest_cache
rootdir: /home/slarse/python
collected 3 items / 2 deselected / 1 selected                            

test_mul.py::test_multiply_equal_numbers PASSED                    [100%]

==================== 1 passed, 2 deselected in 0.05s =====================

And that's pretty much all there is to the -k option. It's extremely useful when test suites grow in size, and I use it daily.

Using the `-m` option to select by marker

With -m, we can select tests by markers. You can mark a test function (or class) by placing a decorator above it.

# test_mul.py
import pytest

def mul(lhs, rhs):
    return lhs * rhs

@pytest.mark.normcase
def test_multiply_equal_numbers():
    assert mul(5, 5) == 25

@pytest.mark.edgecase
def test_multiply_by_zero():
    assert mul(1, 0) == 0

@pytest.mark.normcase
def test_multiply_different_numbers():
    assert mul(5, 3) == 15

Note that we must actually import the pytest module to be able to mark tests with @pytest.mark.x. Now, we can run all tests marked with e.g. normcase like so.

$ pytest -v -m normcase
========================== test session starts ===========================
platform linux -- Python 3.8.5, pytest-6.1.0, py-1.9.0, pluggy-0.13.1
cachedir: .pytest_cache
rootdir: /home/slarse/python
collected 3 items / 1 deselected / 2 selected                            

test_mul.py::test_multiply_equal_numbers PASSED                    [ 50%]
test_mul.py::test_multiply_different_numbers PASSED                [100%]

============================ warnings summary ============================
test_mul.py:6
  /home/slarse/python/test_mul.py:6: PytestUnknownMarkWarning: Unknown pytest.mark.normcase - is this a typo?  You can register custom marks to avoid this warning - for details, see https://docs.pytest.org/en/stable/mark.html
    @pytest.mark.normcase

[... 2 WARNINGS OMITTED ...]

-- Docs: https://docs.pytest.org/en/stable/warnings.html
============== 2 passed, 1 deselected, 3 warnings in 0.01s ===============

Note that this resulted in 3 warnings, one for each of the markings. The reason for this is that newer versions of pytest want you to register markers, as described here. The purpose of this is to avoid users misspelling markers, and registering them will make the warnings go away.

As might be expected, the -m option also accepts logical expressions using not, and and or, just like the -k option does. Personally, I very rarely use -m when using pytest, but some people swear by it, which is why I wanted to include it in this article.

Trick: Grouping related tests into classes makes selection easier

A trick that I like to employ is to group related tests into classes. The class name is then incorporated into the test's name, and it becomes very easy to select tests that are part of the same class. Here's a simple example, where I'm testing two functions in the same module test_arithmetics.py:

# test_arithmetics.py
def mul(lhs, rhs):
    return lhs * rhs

def div(lhs, rhs):
    return lhs / rhs

class TestMul:
    """Tests for the mul function."""

    def test_multiply_equal_numbers(self):
        assert mul(5, 5) == 25

    def test_multiply_by_zero(self):
        assert mul(1, 0) == 0

    def test_multiply_different_numbers(self):
        assert mul(5, 3) == 15

class TestDiv:
    """Tests for the div function."""

    def test_divide_equal_numbers(self):
        assert div(10, 10) == 1

Note that in grouping test functions into test classes, the self argument must be added. This is a little bit annoying, as I rarely if ever use the self argument in a test case, but it's something that has to be done.

Now, I can for example run only the tests in TestDiv like so.

$ pytest -v -k TestDiv
========================== test session starts ===========================
platform linux -- Python 3.8.5, pytest-6.1.0, py-1.9.0, pluggy-0.13.1
cachedir: .pytest_cache
rootdir: /home/slarse/python
collected 4 items / 3 deselected / 1 selected                            

test_arithmetics.py::TestDiv::test_divide_equal_numbers PASSED      [100%]

==================== 1 passed, 3 deselected in 0.05s =====================

Note that the test name that's printed above includes the class name, which is why it is possible to select it with -k. Of course, grouping related tests into modules is equally viable, as the module name (here, test_arithmetics.py) is also part of the test name. I typically do both by creating one test module per module of production code, and one test class per production code function. This allows me to easily select tests at two levels of granularity, which comes in very handy.

Summary

Selecting a subset of test cases to run is crucial to my development workflow. When there are 100s or even 1000s of tests to run, running all of them is often not what you want to do. My preferred way of selecting test cases is by using the -k option to match substrings of test names, but the -m option is also there for those that like to put marker decorators in their code. Finally, grouping related tests into modules and classes allows for easy selection of tests on two levels of granularity, which is something that I exploit daily.

Thoughts on graduating with an MSc in Computer Science and Engineering

2020-09-29T00:00:00+02:00

After five long years of studies (seven if you include the two years of materials science), I've finally graduated with an MSc in Computer Science and Engineering from KTH Royal Institute of Technology. I'm still awaiting my degree certificates, but the thesis is published and I don't have to do anything but wait. I have two weeks left of my one month off before I start working, and I found that now would be a good time to reflect a bit on my education.

A CS degree does not an engineer make

Early on in my education, it became abundantly clear to me that my CS degree would be highly theoretical, and the practical elements were mostly toy projects. I needed side projects, both to practice applying the theory I learned in class, and to get experience with common software development practices such as version control (Git) and issue management.

While my first few projects were toy projects, such as clanim, I started my grail project in RepoBee fairly early during my bachelor's (technically, I started its predecessor). This was a "real" project for me, that I used daily and was also used by others. This gave me great incentive to create a good product and keep working on it. As RepoBee is a management tool for version control in education, it also came naturally to adopt proper version control practices, as opposed to just winging it.

The takeaway from this is that in order to be well-equipped for work after school, side projects are really invaluable. Not only are side projects invaluable, but I think it's important to work on real projects. It creates incentive to keep going, and also gives you something to show off to future employers. If you can't come up with something yourself, then there are a borderline innumerable amount of open-source software projects out there that need all the help they can get. Such as RepoBee :)

A CS degree gives you an exceptional theoretical foundation in computing

Although I think the engineering aspects were lacking in my education, the theoretical foundation that I now possess is nothing short of incredible. I never thought I could learn so much about mathematics, algorithms, operating systems, network protocols, computer security etc in only five short years. I also learned how to learn, and how to do so efficiently. This is probably the most important thing you can take with you from university.

I think that a lot of the theory would have been very hard for me to learn on my own, whereas the practical engineering practices were not. As such, in hindsight, I appreciate the heavy emphasis on theory. I've had a large amount of use for my knowledge of algorithms, data structures and time complexities already, and given my interest in programming languages and version control systems, I expect this trend will carry on.

A lot of people will tell you that a CS degree is not worth it, that you don't even learn the practical skills you need for engineering. This is true to an extent, but I think it's an oversimplification and whether or not it's worth it is highly individual. For me, the degree was entirely worth it. I was exposed to subjects I would not have found on my own and was taught concepts I would have struggled to grasp without a tutor. I also greatly enjoy learning for the sake of it, and I like to dive deep. I like to understand how things work, rather than just understand how to use them. A CS degree is definitely not for everyone, but the blanket statement that it isn't worth it is simply false. For those that enjoy learning and have a deep interest in programming, CS is the way to go. If you want to learn the practical skills you need to land a job as fast as possible, then it probably isn't.

Teaching is learning

After finishing the first year of my studies, I applied and was accepted to a position as a teaching assistant. I would go on to work as a TA during The remaining four years of my studies, year-round. I held exercises, worked labs, corrected student submissions, developed coursework, and much more. This greatly accelerated my own learning, for two reasons. First, in order to teach a subject, you really must learn it well, and students' questions inevitably highlight the shortcomings in your own knowledge. I received so many questions that I could not answer that I likely would not have known I could not answer had those questions not been asked. Second, in working as a TA I was introduced to other more senior TAs, who were much more knowledgable than I was. Discussions with them would lead to my learning things that I would not have found out on my own.

Another benefit of working as a TA was that I got the opportunity to develop RepoBee as a paid project, giving me another source of income during the summer and winter breaks. I also got the opportunity to write some short research papers, attend conferences, and connect with other faculty. If I wanted to, I could easily launch an academic career at this point. However, even though I enjoy science, I am more interested in practical engineering, and so an academic career seems unlikely at this point.

My point here is simple: if you have the opportunity to teach, then do so! I attribute a lot of my success to my experience as a TA, and many doors have been opened for me as a result of working with other academics. I can't recommend it enough.

Closing thoughts

My education has overall been a great experience. I've met a lot of interesting people and done a lot of interesting things. I've taken classes, taught classes, written software, theses and conference papers, and despaired in the face of the odd inordinately difficult exam. Although I greatly enjoyed my time at university, I don't want to continue with a PhD. I feel done with studies. For the time being, I'll be working as a research engineer developing experimental software at KTH, which seems like a nice middle ground between academics and industry. After that, I don't really know, which is pretty exciting in and of itself.

The Linux /etc/passwd file, and why it doesn't contain passwords

2020-08-02T11:51:33+00:00

On any Linux distribution, there's a file located at /etc/passwd. This file contains information about users that exist on the system, including their username, user id, group id and more. In this short article, I'll outline the structure of the /etc/passwd file, and also illuminate why it doesn't typically contain any passwords.

Layout of the `/etc/passwd` file

The layout of the /etc/passwd file is fairly simple. Each line represents a user on the system, with different fields being separated by colons as follows:

name:password:UID:GID:GECOS:directory:shell

name and password are the username and password of the user, UID is the user's numerical id, GID is the id of the first group the user belongs to, GECOS is an optional comment, directory is the user's home directory, and shell is the path to the executable that launches the user's preferred shell. As an example, a part of my /etc/passwd file looks like this:

Note: You can find the groups users belong to in the /etc/group file.

root:x:0:0::/root:/bin/bash
slarse:x:1000:985::/home/slarse:/bin/bash
mysql:x:970:970:MariaDB:/var/lib/mysql:/sbin/nologin

We can see that the root user has the fields set as follows:

password=x
UID=0
GID=0
GECOS=
directory=/root
shell=/bin/bash

The user and group IDs of the root user are always 0, and it typically has its home directory in /root. But is the password of root user really x? No, it isn't. An x in the password field means that the password is located in the shadow file. More on that in the next section. The entry for my own user, slarse, is largely similar to that of the root user.

The entry for the mysql user is however a bit different. For starters, it has a comment in the GECOS field saying MariaDB, which indicates that the mysql user is actually used by the MariaDB fork of the MySQL database system. It also has in interesting login shell, namely /sbin/nologin. The description of the nologin program from its manpage simply reads: nologin - politely refuse a login. This program simply refuses a login, regardless of what credentials are supplied.

And that's pretty much it for what the /etc/passwd file contains. For more details, you can read the passwd (5) manpage. Now, what about that shadow file?

Hint: To access section Y of a manpage PAGE, type man PAGE.Y into a terminal. For example, to access passwd (5), you type man passwd.5.

The `/etc/shadow` file

The /etc/passwd file is a so-called world-readable, meaning that any user on the system can read it. Many programs use this file to map users to their ids, for example, and so its broad accessibility is necessary. A side effect is that storing encrypted passwords in the /etc/passwd file lets any user that has access to the system read the encrypted password of any other user. In times long past, when cracking encrypted passwords was computationally infeasible, this wasn't really a problem. Nowadays however, cracking an encrypted password is only a matter of (feasible) time.

Note: The /etc/passwd file is word-readable, but it's only writeable by root to avoid other users tampering with it, such as by replacing an x with an actual password.

The /etc/shadow file presents a solution to this problem. It is readable only by the root user, and contains the encrypted passwords of users with an x in the password field of their /etc/passwd entry. The shadow file is technically optional, but you will probably never find a system that doesn't use it.

I won't go into detail on how the shadow file is structured, as it's not a file that's typically accessed by user space programs. If you want to know more about it, you can read the manpage of shadow (5).

And that's it for this article, hope you learned something!

Essential pytest pt. 1: Controlling the verbosity of output

2020-07-31T20:07:56+00:00

This is the first part of a series of small articles detailing some of the functionality of the pytest testing framework that I find most essential. The series assumes you know how to run tests with pytest already.

In this first part, we'll take a look at the -v and --tb options to control the verbosity of the output.

The test suite

For the purposes of this article, I've implemented a very simple multiplication function called mul, along with a few tests. Here's the entire thing, in a file called test_mul.py:

# test_mul.py
def mul(lhs, rhs):
    return lhs * lhs

def test_multiply_equal_numbers():
    assert mul(5, 5) == 25

def test_multiply_by_zero():
    assert mul(1, 0) == 0

def test_multiply_different_numbers():
    assert mul(5, 3) == 15

Obviously, the implementation of mul is broken, and running pytest gives the following output.

$ pytest
========================== test session starts ===========================
platform linux -- Python 3.8.3, pytest-5.4.3, py-1.9.0, pluggy-0.13.1
rootdir: /home/slarse/python
collected 3 items                                                        

test_mul.py .FF                                                         [100%]

================================ FAILURES ================================
_________________________ test_multiply_by_zero __________________________

    def test_multiply_by_zero():
>       assert mul(1, 0) == 0
E       assert 1 == 0
E        +  where 1 = mul(1, 0)

test_mul.py:8: AssertionError
____________________ test_multiply_different_numbers _____________________

    def test_multiply_different_numbers():
>       assert mul(5, 3) == 15
E       assert 25 == 15
E        +  where 25 = mul(5, 3)

test_mul.py:11: AssertionError
======================== short test summary info =========================
FAILED test_mul.py::test_multiply_by_zero - assert 1 == 0
FAILED test_mul.py::test_multiply_different_numbers - assert 25 == 15
====================== 2 failed, 1 passed in 0.08s =======================

Let's learn how to control how much of what we see here.

Using the `--tb` option to control traceback verbosity

Most of what you're seeing in the output of the previous section is the traceback information. While the traceback shown above is manageable as is, consider that it stems from a single-line function and single-line tests. With that in mind, it's actually pretty freaking verbose. We can show less of it by using the --tb option. We can even shut it off completely.

$ pytest --tb=no
========================== test session starts ===========================
platform linux -- Python 3.8.3, pytest-5.4.3, py-1.9.0, pluggy-0.13.1
rootdir: /home/slarse/python
collected 3 items                                                        

test_mul.py .FF                                                         [100%]

======================== short test summary info =========================
FAILED test_mul.py::test_multiply_by_zero - assert 1 == 0
FAILED test_mul.py::test_multiply_different_numbers - assert 25 == 15
====================== 2 failed, 1 passed in 0.02s =======================

This is useful when you're just trying to figure out what tests are failing, and when test output is just entirely overwhelming. I find myself using it quite frequently. Another useful traceback value is line.

$ pytest --tb=line
========================== test session starts ===========================
platform linux -- Python 3.8.3, pytest-5.4.3, py-1.9.0, pluggy-0.13.1
rootdir: /home/slarse/python
collected 3 items                                                        

test_mul.py .FF                                                         [100%]

================================ FAILURES ================================
/home/slarse/python/test_mul.py:8: assert 1 == 0
/home/slarse/python/test_mul.py:11: assert 25 == 15
======================== short test summary info =========================
FAILED test_mul.py::test_multiply_by_zero - assert 1 == 0
FAILED test_mul.py::test_multiply_different_numbers - assert 25 == 15
====================== 2 failed, 1 passed in 0.03s =======================

This lets us see the exact lines where the test failures occurred. In this case, it shows the lines of the assertions, but it could also for example show the line where an exception was raised.

Another one that I find really useful is --tb=short. It shows the full traceback, but with much less context around each function call. It won't make much of a difference for this short a traceback, but it makes a world of difference for deeply nested function calls.

There are more ways to manipulate the traceback, but these are the two I use the most, aside from the default. To see the other options, refer to pytest -h and look for the --tb option.

Using `-v` to show more verbose test output

The -v option controls the verbosity of test output while the tests are running, and also the verbosity of single items in the traceback. It's really useful when tests take a long time to run, and you want to know approximately where you're at.

$ pytest --tb=no -v
========================== test session starts ===========================
platform linux -- Python 3.8.3, pytest-5.4.3, py-1.9.0, pluggy-0.13.1 -- /usr/bin/python
cachedir: .pytest_cache
rootdir: /home/slarse/python
collected 3 items                                                        

test_mul.py::test_multiply_equal_numbers PASSED                         [ 33%]
test_mul.py::test_multiply_by_zero FAILED                               [ 66%]
test_mul.py::test_multiply_different_numbers FAILED                     [100%]

======================== short test summary info =========================
FAILED test_mul.py::test_multiply_by_zero - assert 1 == 0
FAILED test_mul.py::test_multiply_different_numbers - assert 25 == 15
====================== 2 failed, 1 passed in 0.03s =======================

Note how each test is now shown on a line of its own, as opposed to just . and F in the previous runs. The lines show up as the tests are running, and I find it useful to track long-running tests.

But what about that "single-item" verbosity that I mentioned? When there are single items in the traceback that are very large, such as a list of say 1000 elements, then pytest will truncate them by default. To demnstrate, consider this single (pointless) test:

# test_truncation.py
import pytest

def test_truncation_demonstration():
    assert [0, 1, 2, 3] == list(range(1000))

Running this test will yield a traceback that looks something like this.

$ pytest
[... OMITTED ...]
______________________ test_truncation_demonstration _____________________

    def test_truncation_demonstration():
    >       assert [0, 1, 2, 3] == list(range(1000))
    E       assert [0, 1, 2, 3] == [0, 1, 2, 3, 4, 5, ...]
    E         Right contains 996 more items, first extra item: 4
    E         Use -v to get the full diff

    truncation.py:4: AssertionError

Note how the long list has been truncated such that only the first few items are shown. Note also how pytest is suggesting the use of -v. If we supply -v, it actually still doesn't show the whole list.

$ pytest -v
[... OMITTED ...]
______________________ test_truncation_demonstration _____________________

    def test_truncation_demonstration():
    >       assert [0, 1, 2, 3] == list(range(1000))
    E       AssertionError: assert [0, 1, 2, 3] == [0, 1, 2, 3, 4, 5, ...]
    E         Right contains 996 more items, first extra item: 4
    E         Full diff:
    E           [
    E            0,
    E            1,
    E            2,
    E            3,...
    E         
    E         ...Full output truncated (998 lines hidden), use '-vv' to show

    truncation.py:4: AssertionError

In fact, pytest isn't even showing more output here, it just shows the same things more verbosely. If we stack -v twice, i.e. -vv or -v -v, then we get the full output. I find -v on its own to be rather useless, and typically always supply -vv if I want verbose output. For obvious reasons, I will not show what that output looks like.

Summary

In this article, we had a look at the --tb and -v options to control output verbosity in pytest. --tb controls the overall size of the traceback, and can be supplied with values like no for no output at all, line for just a single line of traceback, or short for complete traceback with shortened context. -v controls the verbosity of running tests and single items in the traceback, such as very long lists which are truncated by default. Typically, -vv is required to get fully verbose output.

And that's about it, I hope you learned something!

Don't use String for method options, use an enum!

2019-11-10T18:56:00+01:00

In this article, we are going to have a look at a method that accepts an option. That is to say, it accepts an argument that somehow decides how it operates. If you use a lot of libraries in your day-to-day programming, you're bound to come across methods that accept String values as such options, and you've probably been infuriated by you misspelling the options, or just trying to figure out what options there are in the first place. That suggests that there must be a better solution, and as you may have figured out by now, enums is that solution.

Note: This article discusses enums in Java, but the very same arguments are valid for any language that has support for enum types, or something comparable.

The problem: What options do I have?

Consider the following method that formats a String according to an option supplied as another String:

Yes, this is a somewhat contrived example, but bear with me!

public static String format(String str, String option) {
    switch (option) {
        case "upper":
            return str.toUpperCase();
        case "lower":
            return str.toLowerCase();
        default:
            throw new IllegalStateException("Internal errror, unmatched option " + option);
    }
}

We could then use the method something like this:

format("hello", "upper");

Can you see any problems with this? Well first of all, if you don't have access to the source, how do you know which options can be passed? There are an infinite amount of strings, after all. At best, the Javadoc will say precisely which values are valid options, but that is not always the case even in the Java standard library. But even if all of the options are clearly documented at one point, it would be so easy for a developer to add or remove an option, and forget to enact the corresponding change in the Javadoc. It is also difficult to have automatic checks that actually verify that all possible options are documented. And even assuming that all options are properly documented at all times, the compiler can't distinguish which String values are valid and which are not, so a user misspelling an option won't know until runtime.

The solution: enums!

My goal here is not to explain the ins and outs of what enums are, but rather show a use case. In short, an enum is a data type with a (typically very) limited amount of possible values (you can read more about it here). Now, let's instead define this enum type:

public enum FormatOption {
    UPPER,
    LOWER;
}

And refactor the method with it:

public static String format(String str, FormatOption option) {
    switch (option) {
        case UPPER:
            return str.toUpperCase();
        case LOWER:
            return str.toLowerCase();
        default:
            throw new IllegalStateException("Internal errror, unmatched option " + option);
    }
}

This method can then be used like:

format("hello", FormatOption.UPPER);

With this small alteration, we have eliminated all of the problems mentioned before. The possible values for the option argument are now self-documented in the FormatOption enum. Additionally, any modern IDE will kindly list the possible values when you type FormatOption., such that a programmer does not even necessarily need to consult the documentation, assuming that the enum values have descriptive enough names. The compiler can also distinguish between FormatOption.UPPER and a misspelling such as FormatOption.UPER, as the latter is not defined, so runtime errors due to invalid options is no longer a problem.

What's the catch?

So what's the catch? Well, if you have many methods like this, you'll end up with a lot of enum types. Personally, I think that's totally worth it, and you could also nest the enums inside the classes that use them to reduce their overall footprint in the project. The whole thing could then look like this:

public class Formatter {
    public static enum Option {
        UPPER,
        LOWER;
    }

    public static String format(String str, Option option) {
        String result;
        switch (option) {
            case UPPER:
                return str.toUpperCase();
            case LOWER:
                return str.toLowerCase();
            default:
                throw new IllegalStateException("Internal errror, unmatched option " + option);
        }
    }
}

Didn't add that much complexity now, did it?

Summary

If you have a method that you want to pass options to, use enums. That's really all there is to it. Enums are quite widely used in the Java standard library as well, such as StandardCopyOption, StandardOpenOption and LinkOption in the java.nio.file API, which are used much in the same way as I used the enum in this article. Hopefully, having read this article, you won't be creating any more APIs that accept String options!

Java's Optional: Why you should prefer it over null

2019-10-11T16:57:00+02:00

Null references are problematic, to say the least. Tony Hoare (inventor of the null reference) even went as far to say call them his "billion dollar mistake". In this article, I first make a cursory exploration of why null references are so problematic, and then have a look at Java's proposed solution: the Optional<T> class.

Why null is problematic

There are many reasons why null is problematic, but there are a few that are particularly easy to illustrate. I will be using the Map.get method as an example, as it returns null if the key provided to it is not in the map. For all of the examples, assume that there is a variable Map<Integer, String> map in the current scope.

null circumvents the type system

Java has a fairly rigorous type system (although it's not entirely sound, I may do an article on that later!). The type system can't help with null, however. Any variable of reference type can either contain a reference to an object of that type, or null. This leads to a whole lot of code that looks like this:

String value = map.get(10);
String valueUpper;
if (value != null) {
    valueUpper = value.toUpperCase();
}
// else do something different with the knowledge that value is null

While you may argue that null checks are pretty ugly, the real problem is that they are not enforced by the type system. The above might just as well have been written like this, and the type checker would have been none the wiser:

String value = map.get(10);
String valueUpper = value.toUpperCase();

This program could crash with a NullPointerException. That's not great. But it can be even worse. What if the call to value.toUpperCase() occurs in an entirely different part of the program, perhaps hours or even days after Map.get returned null? Then, not only do you have a crash, but a crash that can potentially be very difficult to diagnose.

It's easy to miss that a method can return null

The cause of the previous problem is often that it's not always obvious that a method may return null. If you have a look at the documentation for Map.get, you'll see that it says that it may return null a little here and there, and it's pretty clear about that. But still, a careless developer might miss this, and there's also the fact that many methods are not this well documented.

Why not just throw an exception instead?

One question you may ask is, why even return null, why not just throw an exception? Indeed, throwing an exception may be a good solution in many cases, but sometimes it just isn't desirable. In the case of Map.get, it's mostly about efficiency. If Map.get were to throw an exception when the key was missing, you'd essentially have two alternatives.

Ask for forgiveness (catch the exception)

String value;
try {
    value = map.get(10);
} catch (NoSuchElementException e) {
    // handle error
}

That's both ugly, and very inefficient if it is often the case that the key is missing. Catching an exception involves a whole lot of work for the JVM, so you really do not want to do this for an operation that you often perform.

Look before you leap

String value;
if (map.containsKey(10)) {
    value = map.get(10);
}
// else handle missing keys

This is not too ugly, but it is inefficient as the map has to be traversed twice: once to check if it contains the key, and once more to fetch the value associated with the key. Throwing an exception becomes even more undesirable if you're using Streams. So, I think we can safely conclude that throwing an exception is not the be-all-end-all solution.

`Optional<T>` to the rescue

Using the Optional<T> class, we solve the issues discussed previously. Optional is simply a container for another object that may or may not be null. Optional.get returns the contained object if present, or throws a NoSuchElementException if it is not (i.e. it is null). Optional.isPresent lets us check first if the value is present, to avoid an exception if it is not. Let's pretend like there's a method Map.getOptional that returns an Optional<T> instead of just T. We then have several options.

Important: There is no Map.getOptional method, this is just hypothetical. We'll see toward the end of the article how one can wrap Map.get to create a getOptional method.

Retrieve the value without checking if it is present

If you're certain that the value will be present, you may simply retrieve it immediately.

Optional<String> opt = map.getOptional(10);
String value = opt.get(); // `value` will either be non-null, or we crash

This may at first glance just look like more work than previously, two get calls instead of one. The benefit here is that the programmer is extremely unlikely to miss the fact that the returned value may not be present, as the return value itself has the type Optional<String>, and the type system will scream bloody murder if they try to use the Optional<String> value like a String. Requiring that extra get forces the programmer to make a conscious choice of how to handle errors. Now, this may crash with a NoSuchElementException, but as Optional values typically are not passed around, the is unlikely to happen far from where the Optional was produced.

Check if the object is present

If you're uncertain whether the value will be present, you may simply check for it:

Optional<String> opt = map.getOptional(10);
String value;
if (opt.isPresent()) {
    value = opt.get();
}

This is very much like the null check we had before, but it's supported by the type system.

Use a fallback value

Often when we don't want the code to crash, we have a fallback value. Optional has a method to handle that.

String value = map.getOptional(10).orElse("Nothing here :)");

If the value is present, it is returned. Otherwise, we get "Nothing here :)". This, I think, is one of the cleanest and tidiest uses of Optional.

What about drawbacks?

Of course, Optional has drawbacks. As every single object is wrapped in another object, there will be an increase in memory consumption. The extra method call may result in a noticeable performance penalty, but I don't dare say anything concrete about that without running some tests (the JVM is pretty darn good at inlining and optimizing). Perhaps, your most performance critical segments of code should not use Optional. But the vast majority of your code is not performance critical, so most often it will be a moot point. Finally, there's also a little bit of extra boilerplate to deal with, which also may not be desirable.

Alright, I'm sold, how do I `Optional`?

We've already had a look at how to consume Optionals. But how do we produce them? It's really quite easy. The three most important methods are:

static <T> Optional.of(T value)
- A static method that wraps a value in an Optional instance. Throws an exception if value is null.
static <T> Optional.empty()
- A static method that returns an empty Optional instance.
static <T> Optional.ofNullable(T value)
- A static method that wraps a value in an Optional. The value may be null, which essentially produces an empty Optional.

The of and empty methods are the ones you want to produce Optional values from scratch. Here's a useless but simple example: an identity function for integer values that is only defined for n >= 0. Using null, it would look like this.

/**
 * @param n An Integer value.
 * @return n iff n >= 0, otherwise null
 */
public Integer id(Integer n) {
    if (n >= 0) {
        return n;
    }
    return null;
}

This comes with all of the previously discussed problems associated with null return values. Here's the equivalent method using Optional.

/**
 * @param n An Integer value.
 * @return An Optional with n iff n >= 0, otherwise an empty Optional.
 */
public Optional<Integer> id(Integer n) {
    if (n >= 0) {
        return Optional.of(n);
    }
    return Optional.empty();
}

Notice how both the documentation, and the return value itself, clearly states that the value returned from the method may not be present. It is more or less impossible to miss that this method may return an empty value (as long as you know what Optional is, that is).

The ofNullable method is great for wrapping existing methods that may return null. For example, assuming that we have the Map<Integer, String> map field from earlier, we can wrap its get method in our own getOptional.

public getOptional(Integer key) {
    String value = map.get(key); // might be null!
    return Optional.ofNullable(value);
}

This lets us easily create APIs that contain two versions of, for example, a getter method: one that returns T, and one that returns Optional<T>. And that's all the essentials. Not that hard, right?

Summary

Optional solves most of the problems with null references in a mostly elegant way. The most important thing with Optional is that it is a strong form of documentation, which states both to the programmer and to the compiler that the value asked for may be present. There's also the fact that Optional provides both the null-check approach using isPresent, and the exception-throwing approach by calling get without checking for presence. As such, the caller of a method gets to decide which of these approaches to use, increasing flexibility. Optional is also a critical part of the Stream API, which would be forced to throw exceptions left and right without it (and you'd be forced to catch them!). Although the use of Optional may incur a performance penalty, it is trivial to provide two versions of performance critical methods: one that returns an Optional<T> and one that just returns T. If you want to learn more about Optional, I recommend first checking out the API documentation. I also encourage having a look at the source code for Optional, it's a surprisingly simple class that provides all of this functionality!

RepoBee (and Simon) at ITiCSE 2019!

2019-07-22T23:02:00+02:00

I just got home from the Innovation and Technology in Computer Science (ITiCSE) conference in lovely Aberdeen, Scotland. I was there to present a small experience paper on developing and using RepoBee, an open source tool for generating and managing large amounts of Git repositories for students in higher education. The paper is available over in the ACM Digital Library, and it's my very first publication outside of school, so I'm quite proud of it!

The conference itself was also terrific. The atmosphere was very friendly, there were many interesting talks and social activities, and I have a hard time imagining how a conference could be more inviting to a first-time conference-goer like me. If you have a chance to attend it, I highly recommend doing so.

Git worktrees: work in parallel on multiple versions of a project

2019-07-22T22:01:00+02:00

I've been AWOL for a month due to injury, sickness and conference-going. But with all that finally out of the way, I have another Tip of the Week, this time relating to Git: the git worktree command. With git worktree, you can check out multiple branches at once, which is super useful for when working on major changes where you need to view multiple versions, or maybe you're just trying a few different solutions to a single prodlem. If you've ever found yourself frantically switching branches, stashing changes to be able to switch branches, and even creating copies of the repository you're working in, then this article is for you.

An example repo

Let's first create an example repo. Here's a little terminal session where I create a repository, add a README to it on the master branch, add another line to the readme on a branch called other, and finally checking out to master.

[~] $ mkdir repo
[~] $ cd repo
[repo] $ git init
Initialized empty Git repository in /home/slarse/repo/.git/
[repo] $ echo "Hello!" > README.md
[repo] $ git add README.md && git commit -m 'Add README'
[master (root-commit) 6094baf] Add README
 1 file changed, 1 insertion(+)
 create mode 100644 README.md
(master)[repo] $ git checkout -b other
Switched to a new branch 'other'
(other)[repo] $ echo "There!" >> README.md 
(other)[repo] $ git commit -am 'Add new line to README'
[other b779dfb] Add new line to README
 1 file changed, 1 insertion(+)
(other)[repo] $ git checkout master
Switched to branch 'master'
(master)[repo] $

It's not super important how you do it, just make sure to have two branches.

Adding a new worktree

First of all: what is a worktree? Usually, you only have the worktree, which is the part of a repository where you actually do your work (edit files etc). Running git worktree list on most repos will show the location of this single worktree, and what commit/branch it is checked out to.

(master)[repo] $ pwd # just checking the current working directory
/home/slarse/repo
(master)[repo] $ git worktree list
/home/slarse/repo  6094baf [master]  # points to the cwd, checked out to master

Note: When I run git worktree list after this point, it's just to show the results of commands.

With git worktree add, you can add additional worktrees checked out to different commits. The most basic usage is git worktree add <path> <commit-ish>, where path is a path to the new worktree (i.e. where you want to put it), and commit-ish is something like a commit or branch (or a few other things that are not important for every-day use). Let's check out other in a new worktree.

(master)[repo] $ git worktree add ../repo-other other
Preparing worktree (checking out 'other')
HEAD is now at b779dfb Add new line to README
(master)[repo] $ git worktree list
/home/slarse/repo        6094baf [master]
/home/slarse/repo-other  b779dfb [other]
(master)[repo] $ ls -a ../repo-other # have a look in the new working tree
.  ..  .git  README.md

As you can see, the new worktree has been created, and can be seen in the list of worktrees. .git is usually a directory, but in the case of a non-primary worktree, it's actually just a file with a path to the original .git directory.

(master)[repo] $ cat ../repo-other/.git 
gitdir: /home/slarse/repo/.git/worktrees/repo-other

Like many things in Git, it's brilliantly simple. You can start working in your new worktree like it's an entirely separate repository, with the caveat that you can't check out to a branch that is checked out in some other worktree. That includes checking out to other commits or branches, and even creating entirely new branches.

Moving a worktree

If for some reason you need to move a worktree, you should use git worktree move to make sure that all of the references are correctly changed. It's very simple, just type git worktree move <src> <dst>. For example, if I want to move ../repo-other to ../repo-work, I do:

(master)[repo] $ git worktree move ../repo-other ../repo-work
(master)[repo] $ git worktree list
/home/slarse/repo       6094baf [master]
/home/slarse/repo-work  b779dfb [other]

That's all there is to moving worktrees. Not very exciting, and I can't recall ever actually doing it, but I can see how it could be useful.

Removing a worktree

To remove a worktree, run git worktree remove <path>.

(master)[repo] $ git worktree remove ../repo-work/
(master)[repo] $ git worktree list
/home/slarse/repo  6094baf [master]

You can also just remove the directory with the worktree and the reference to it will be removed automatically (but not necessarily immediately). Run git worktree prune to trigger this removal process.

The other worktree commands

There are a few more git worktree commands that I've never felt the need to use. Have a look at them in the git-worktree documentation.

Summary

In this short article I showcased git worktree. It's super useful to work in parallel on different versions of the same project, without having to create copies of the repository and thereby having to deal with synchronizing multiple local copies (which can quickly get hard to manage). I find myself using this more and more, and if you find it useful yourself I highly recommend reading up on it more in its man-page (either with man git-worktree or online).

Redirecting stdout and stderr in bash

2019-06-23T21:37:00+02:00

A couple of weeks ago I covered some basic I/O redirection in bash (see I/O redirection in bash). Well, there's actually a lot more to it, so for this TOTW I thought I'd touch on a few more advanced usages.

Redirecting stderr

Sometimes, you may find that part or all of the output of a command isn't properly redirected. As a quick example, navigate to any directory that is not a Git repository, and run git status. You should see something like this:

$ git status
fatal: not a git repository (or any of the parent directories): .git

Yet, if you try to redirect it with a standard redirect, the output is still displayed, and the file you redirect to remains empty.

$ git status > output
fatal: not a git repository (or any of the parent directories): .git
$ cat output
$

The reason is quite simple: the output from git status is an error message, which is typically output on standard error (stderr), while I/O redirection operates on standard output (stdout) by default. When redirecting output (or input, for that matter), one can optionally provide a file descriptor specifying which output stream to redirect. On a typical UNIX-like system, stdout is file descriptor 1, and stderr is file descriptor 2. So if we want to catch that stderr output, we just need to prepend a 2 to the redirection operator.

$ git status 2> output
$ cat output
fatal: not a git repository (or any of the parent directories): .git

You can probably guess that if you leave the file descriptor out, it will default to 1. In some cases, you may want to redirect both stderr and stdout to the same file. But many programs output both on stderr and stdout, and we may want to redirect both of them.

Redirecting stderr and stdout

So, we can specify a file descriptor to redirect stdout or stderr (or any other file descriptor, really), but many programs output on both stderr and stdout, and it's often useful to redirect both. Here's a small Python script print.py that outputs on line on stdout and one on stderr.

import sys

print("some standard output", file=sys.stdout)
print("some error output", file=sys.stderr)

Note: That's Python as in Python 3.

If we redirect stdout only, then the stderr line is still printed to the terminal.

$ python3 print.py 1> stdout_output # recall that the 1 can be omitted
some error output
$ cat stdout_output
some standard output

Similarly, redirecting only stderr leaves the stdout output on the terminal.

$ python3 print.py 2> stderr_output
some standard output
$ cat stderr_output
some error output

Quite intuitively, if we want to redirect both stderr and stdout to one file each, we can simply do two redirections following one another.

$ python3 print.py 1> stdout_output 2> stderr_output
$ cat stdout_output
some standard output
$ cat stderr_output
some error output

There's also the possibility to redirect both stdout and stderr to the same file using the special & character in place of a file descriptor.

$ python3 print.py &> output
$ cat output
some standard output
some error output

And with that and the previous article, I've shared pretty much everything I feel is useful with output redirection. In some future Tip of the Week, I'm sure I'll get into input redirecton as well, as it's much the same.

Technical e-books from Humble Bundle

2019-06-13T09:12:00+02:00

Another Tip of the Week, in the same week as the previous one (because I've been slacking off). This one is very simple, and rather non-technical. I simply want to direct attention toward humblebundle.com, and their quite frequent book bundles. Many of these book bundles include or are entirely centered around programming topics. I've gotten tons of great books from there over the years, and it can really pay off to keep an eye on the new bundles. Typically, the top tier of a book bundle is around 15-20 EUR, while most books in it cost more alone. There have been general programming bundles, hacking/security bundles, Python bundles (a lot of them), Linux bundles, and much more. A few of the best books I've gotten from Humble include:

The Linux Programming Interface
- Got it in a 20 EUR bundle, while this book alone costs somewhere around 70.
Fluent Python
- I actually already owned this book at the time of acquiring it from a bundle, but it's by far the best book on Python I've ever read.
Flask Web Development
CSS: The Definitive Guide

And many, many more! Really, all I want to get said with this TOTW is that you should really keep an eye on Humble Bundle, as their book bundles contain tremendous value, and you can choose to put some (or all) of the money you pay toward charity. And that's all there is to it!

Disclaimer: I am not sponsored by Humble in any way, I am simply a long-time user of the service.

I/O redirection in bash

2019-06-11T23:16:00+02:00

Alright, so Tip of the Week has turned somewhat into "tip every two or three weeks". It turns out that it's pretty difficult to find the time to actually write something every week. but I'll keep trying. With that out of the way, let's head into the subject matter of this post: I/O redirection. We'll just have a look at the most basic but also most generally applicable use of redirection: taking the output from a program and storing it in a file.

Important: Files will both be created and clobbered in this TOTW. When trying this stuff out, first create a new directory and do everything in there, so you don't litter your filesystem with strange files, or accidentally overwrite something important.

Redirecting output

To set the stage, I'll be working in a directory with the following contents:

[tmp] $ ls
file1.txt  file2.txt  image1.png file2.txt

Redirecting output is fairly simple, and useful when you want to save the output of some command in a file. There are two primary ways of redirecting output: appending and truncating. Appending is the one I use the most, so let's start with that one.

Appending output redirection

With >>, we can make an appending redirect.

[tmp] $ ls >> ls_output.txt  # output from ls saved to output.txt
[tmp] $ cat ls_output.txt    # let's have a look... 
file1.txt
file2.txt
image1.png
image2.png
ls_output.txt
[tmp] $ ls >> ls_output.txt  # append new output
[tmp] $ cat ls_output.txt    # let's have a look again
file1.txt
file2.txt
image1.png
image2.png
ls_output.txt
file1.txt
file2.txt
image1.png
image2.png
ls_output.txt

There are three things to note here. First, the ls_output.txt file does not exist in the initial directory, and so it is created with the first redirect. Note however that ls_output.txt is present in the first redirected output from ls: ls_output.txt is actually created before ls is run as there needs to be an open file descriptor* to the file pass along.

** * ** A file descriptor can simply be thought of as a pointer to a file. There is no need to understand file descriptors intimately to use basic I/O redirection efficiently.

The second redirect is then appended to the file, which at that point already exists. And that pretty much sums up how an appending redirect functions: it appends output to the specified file if it exists, and creates a file with the output if it does not exist. I find that this is most often the functionality that I want, but in some cases, you want to re-create the file from scratch with each redirect. That can be achieved with a truncating redirect.

Note: You may note that the output of ls is formatted differently when output to the terminal, and when redirected to a file. ls checks whether the stdout file descriptor points to a terminal, or something else, and formats the output accordingly. The details are somewhat out of scope.

Truncating output redirection

Let's assume that we start over from the initial state of the directory, before ls_output.txt existed. We can then make a truncating redirect with >.

[tmp] $ rm ls_output.txt    # restore initial directory state
[tmp] $ ls > ls_output.txt  # make a truncating redirect
[tmp] $ cat ls_output.txt   # and inspect the results
file1.txt
file2.txt
image1.png
image2.png
ls_output.txt
[tmp] $ ls > ls_output.txt  # another truncating redirect
[tmp] $ cat ls_output.txt
file1.txt
file2.txt
image1.png
image2.png
ls_output.txt

If you did not know what truncating meant before, you can probably figure it out now. With a single >, the specified file is created if it does not exist, just like with >>, but it is entirely overwritten (truncated, clobbered) if it already does exist. I rarely use a truncating redirect, as it is an easy thing to accidentally truncate a file you did not mean to touch. I recommend to always use an appending redirect, unless you have a good reason to truncate the targeted file.

And that's it for this TOTW!

Piping commands in bash

2019-05-21T00:00:00+02:00

Many, many bash commands are built around and meant to be used with a fundamental feature of the bash shell (actually, most shells), called piping. Put simply, piping takes the output of one command and provides it as input to the next. Here's a simple example of running ls and filtering the result with grep to find all .py files in the current directory.

$ ls # just run ls 
file1.md  file2.md  file3.md  script1.py  script2.py
$ ls | grep '\.py$'
script1.py
script2.py

To be precise, the | (pipe) operator takes the output from the command on the left, and provides it as input to the command on the right. Pipes can be chained practically as much as you'd like. For example, if we want to get amount of .py files in the current directory, we can pipe the output from grep to the wc (word count) command, with the -l option to count lines only.

$ ls | grep '\.py$' | wc -l
2

wc counts two lines, which is precisely the amount of .py files that we found. Let's move on to I/O redirection. Piping allows you to easily compose powerful programs from simple commands, and is a very intuitive way to work. Next week, I'll cover I/O redirection, which is another super useful feature of bash that's a bit more complicated.

Using bash aliases

2019-05-06T12:19:00+02:00

For this Tip of the Week, I'd like to present something that took me a while to figure out why it was useful. That something is bash aliases, and I'll now walk you through how to create aliases, and the two main ways in which I use them (although I'm sure there are more use cases).

Using aliases

I think the bash manpage has a very good and concise description of what an alias is:

Aliases allow a string to be substituted for a word when it is used as the first word of a simple command

In other words, I can define a command that is substituted for some other command. Creating an alias is very simple. The syntax looks like this:

alias <NAME>=<COMMAND>

So for example, if I want to have a command hellofile that creates a file with the text "Hello, world!", I can achieve that with the following alias.

$ alias hellofile='echo "Hello, world!" > hellofile.txt'

Note the single quotes around the command definition. Without them, bash would interpret the alias as being only echo, and the rest of the line as another command. Now, if I run the command hellofile, it fill be substituted with echo "Hello, world!" > hellofile.txt. You should think of aliases as pure text substitution: precisely what you put in the alias definition will be put on the command line when you invoke it. You can view all of your current aliases by running alias without any options. Now, let's have a look at some common use cases!

Specifying "default" options for commands

This is probably the most common use case for aliases, and it's likely that you already have some in play. A common one is to have ls aliased to ls --color=auto. That is to say, the following alias is defined:

$ alias ls='ls --color=auto'

So if I now run e.g. ls /etc, the resulting command is actually ls --color=auto /etc. Note how the alias does not have to be the only word I type for the command, it just has to be the first one. Another command that I use an alias for is xclip, which is a small utility for copying stuff. I use it almost exclusively to copy file contents to the clipboard, but that's not the default functionality. In order to copy to the clipboard, I must write this rather cumbersome command.

$ xclip -selection clipboard <FILEPATH>

So I have an alias for it so I can just type xclip <FILEPATH> to copy to the clipboard.

$ alias xclip='xclip -selection clipboard'

As a side note, it may not be the best style to clobber an existing command with an alias, but I still tend to do that for some of my most commonly used commands. If you want to use the vanilla command, simply put it within single quotes, which will hinder the alias from expanding (e.g. type 'ls' to run ls without --color=auto). Note that just defining an alias in a bash session will not persist: it needs to be defined anew for each session. To have it permanently defined, put the definition in a startup script (e.g. .bashrc or .bash_profile).

Creating throwaway commands

Now, the aliases I described above are useful to have defined permanently, and should be defined in a startup script. The second use case I have for aliases is when I have a repetitive command that I need to type over and over in the same session, but isn't useful in general. An example would be when I need to run some specific Java class in a project. Let's say I need to run the class se.slar.awesome.project.Main over and over. Instead of typing java se.slar.awesome.project.Main over and over, I define an alias for it.

$ alias runmain='java se.slar.awesome.project.Main'

And then, instead of writing all of that out, or having to do some reverse searching or history lookups, I can just type runmain. As defining an alias is so effortless, I tend to do it even if I know I'm just gonna use the complex command a couple of times.

And that's all I wanted to cover, hope you enjoyed it and stay tuned for the next TOTW coming next week!

Git local

2019-04-29T22:58:00+02:00

Nowadays, Git is almost ubiquitous in software development. Most developers also know that Git is a decentralized version control system, meaning that every copy of the repository carries the full revision history, and there is no "central" repository. A consequence of the decentralized aspect of Git is that you can create repositories locally, and version control documents in them locally, without ever setting up a remote repository on e.g. GitHub or GitLab. In this TOTW, I'll show you how to use Git locally, and also how to change your mind and put it on e.g. GitHub at a later time.

Note: This also touches on an important and often misunderstood point: Git and GitHub are not the same thing. Git is a version control system, while GitHub is a service which allows hosting of remote repositories, issue management etc. GitHub is also not the only service around, GitLab and BitBucket are two other prominent services which host Git repositories.

Using Git locally

How do you use Git locally, then? It's simple. Just create a directory and run git init to initialize it as a Git repository. Here's an example command line session of what it looks like.

[~] $ mkdir repo
[~] $ cd repo
[repo] $ ls -a
. ..            # repo is empty
[repo] $ git init
Initialized empty Git repository in /home/slarse/repo/.git/
[repo] $ ls -a
.  ..  .git     # the .git directory indicates that this is now a Git repo

I often use Git to version control stuff that I have no intention of ever putting up in a remote repository. This is useful for when you accidentally remove stuff, or just need to try out a bunch of different ideas that you can swap back and forth between by simply switching branches.

Changing your mind (also called adding a remote)

If you suddenly feel like that local repo should be put up on a hosting service after all, maybe just to back it up, or maybe to collaborate with someone else, it's very simple to do so. First, create an empty repository (as in completely empty, don't initialize it with a README or license). Then copy the address to the repository (I prefer to use SSH). Let's say I have a repo at git@github.com:slarse/superrepo.git. I can then add it as a remote to my local repo, and push my master branch to it.

[repo] $ git remote add origin git@github.com:slarse/superrepo.git
[repo] $ git branch
* master  # I'm on the master branch, which is what I want to push
[repo] $ git push --set-upstream origin master
Enumerating objects: 3, done.
Counting objects: 100% (3/3), done.
Writing objects: 100% (3/3), 213 bytes | 213.00 KiB/s, done.
Total 3 (delta 0), reused 0 (delta 0)
To github.com:slarse/superrepo.git
 * [new branch]      master -> master
Branch 'master' set up to track remote branch 'master' from 'origin'.

Now my previously local-only repo is also in GitHub, and I can push and pull from it as usual. That's all for this tip of the week, it's just meant to spark an idea that took me quite a while to come up with myself!

History and history expansion in bash

2019-04-22T11:59:00+02:00

Admittedly, this TOTW is one day late, so this week there will be 2xTOTW! In any case, the tip I want to bring up here is very much related to last week's TOTW on Reverse search in bash. Sometimes, reverse searching just doesn't work out. You may not be quite sure what you are looking for, or there are just too many recent commands that look samey. In such cases, using the history command is a good alternative.

`history`

The history command will display the last commands that you have entered, and looks something like this:

$ history
 1009  fg
 1010  git status
 1011  git commit -a -m 'Add module docstring to github_api module'
 [***OUTPUT TRUNCATED***]
 2007  history

Each command is called an event, and the output is formatted as <event_nr> <event>. Precisely how many commands are returned by the history is determined by the HISTSIZE and HISTFILESIZE environment variables. Setting these to something like 5000 and 10000, respectively, should be manageable even for the weakest of computers. You can also limit the output of history by providing an integer argument, so e.g. history 5 will display the last 5 commands. Now, the real power of history becomes apparent when using it with history expansion.

History expansion

History expansion can be used to expand an event number into the whole command it corresponds to. To expand an event, one simply types !<event_nr>. For example, looking at the history output above I can see that event number 1010 corresponds to git status. I can execute the command again with history expansion like so:

$ !1010
git status         # Command is echoed
On branch master   # Output from executing the command
[***REST OF OUTPUT OMITTED***]

The command is first echoed, and then executed. There are a few other ways to specify the event number.

!: Execute the last event.
- I.e. type !! in the terminal.
- Can be useful to re-execute a command that you realized you needed sudo for with sudo !!.
-n: Execute the nth previous event.
- E.g. type !-1 to execute the last event, !-2 to execute the one before that, and so on.
- I personally don't find this very useful.

There is one more very useful feature that I often use, and that is the ability to only print the command. This can be achieved by appending :p to the history expansion command. Here is an example:

$ !1011:p
git commit -a -m 'Add module docstring to github_api module'

The command can then be accessed by pressing UP-arrow or ctrl-p, which is very useful if you need to do minor modifications to it. There are tons of more ways to use history expansion, and I strongly recommend reading the man-page on it. Type man bash and then search for HISTORY EXPANSION, or do the same in this online bash man page.

Filtering history

A final tip on using history expansion is to filter the output with grep. For example, if I only want to find commands that include the word git, I can filter the output of history by piping to grep with the | character.

$ history | grep git
 1010  git status
 1011  git commit -a -m 'Add module docstring to github_api module'

I will most likely do another TOTW on piping, but the basic principle is that | takes the output from the command on the left and feeds it as input to the command on the right. That's it for this TOTW, stay tuned for the next one coming on Sunday the 28th of April!

Reverse search in bash

2019-04-09T23:23:00+02:00

Have you ever found yourself furiously tapping the UP-arrow (or ctrl+p) to find a command that's probably waaaay up there? Would you be surprised if I told you there's a better way? When you want to re-use a command you've written previously, and you know it's not the previous command, or the one before that, your first resort should be a reverse search. This can be accessed with ctrl+r. If you press that button combination, you should see something like this:

(reverse-i-search)`':

Just start typing the beginning of the command you're looking for, and most often, it will pop up. For example, I sometimes need to re-run the previousgit command that I ran a while back. I then press ctrl+r and type git to get something like this:

(reverse-i-search)`git': git push

Note how the initial git before the : is what I've actually written here, and the text after the : (in this case git push) is what's been found with the reverse search. Pressing tab now will terminate the search and put the result of the search on the command line for editing. Then, simply press enter to execute the command as usual. You can also skip over the editing part and press enter right away to execute the command as-is. Sometimes, however, the result you get first isn't what you want (obviously, just typing git push would have been faster in this case). You can then press ctrl+r again to cycle to the next hit.

(reverse-i-search)`git': git commit -a -m 'Add module docstring to github_api module'

Now there's a command that I might not want to have to type out again in its entirety, better showing why a reverse search may be useful. That's it for this week's TotW, check back next week for more!

Announcing Tip of the Week (TotW)

2019-04-09T23:15:00+02:00

In order to actually get around to writing some content, I've decided to start a little series: Tip of the Week! Every week, I'll spend 30 minutes or so writing a very small article about some tip related to programming, Linux or technology in general. And no, this one does not count, so I still have to write this week's TotW!

Migrating my blog

2019-04-06T18:32:00+02:00

Edit: The migration is complete! The Flask-based site has been retired and this Pelican-based site is fully fleshed out with the old content :)

I'm currently in the midst of migrating my old blog over here. Until I'm done, both sites will be a bit half-baked, sorry about that!

Testing tips: Tests that don't test

2019-03-05T22:07:56+00:00

Unit testing is a skill that takes some time to develop, and there are numerous pitfalls for the beginner. As I've done my fair share of unit testing, and taught a lot of students what I know, I've decided to share my top tips of things to think about when testing. First up is one that may seem obvious, but beginners and experienced testers alike fail with on occasion: make sure you are actually testing something.

Tests that don't test

Quite often, I find tests written by students that don't actually test anything, and will pass regardless of what the student's code is doing. Sometimes, I find tests written by yours truly that are similarly ineffective. A test that passes when it should not is dangerous, because it makes you feel confident about code that isn't properly tested. On the flip side, a test that fails when it should not is annoying and may hamper productivity, but unlike a falsely positive test, it is highly noticeable. The devious part of tests that don't test is that they easily slip by unnoticed, you don't often investigate a test that passes! These tests generally come in four flavors:

Not calling the function under test.
Copy mistakes with references/pointers.
Mistakes during setup.
Mistakes with assertions.

Even though I have a few years worth of testing experience, and have written thousands upon thousands of tests, I still make these mistakes from time to time. Let's first go over them one by one to get a feel for what can go wrong. After that, I'll share my techniques for catching these errors. For all of the examples, we will look at a test case for sorting a randomly ordered list with an in-place sorting algorithm. The implementation under test is called mysort. Assume that, for all examples, a list called random_list with randomly ordered elements is setup in a fixture. The tests will be written in pytest syntax, but most problems and solutions are easily transferable to many other languages and testing frameworks (e.g. JUnit in Java). Here is the test header and docstring. Note the inclusion of the random_list fixture as a parameter. In the test, it can simply be used as a list.

def test_sort_randomly_ordered_list(random_list):
    """Sort a randomly ordered list and ensure that the result for
    ``mysort`` is the same as the built-in ``list.sort``
    """

For brevity, the docstring will be excluded from now on. Let's get to it the, shall we?

Not calling the function under test

This mistake definitely sits in the top two most common ones that I encounter. A typical example of this is when using redundant computation to produce a test oracle. That is, using some other implementation of the function under test to compute the expected result. What I've seen happen many times is that the student by mistake uses the other implementation for both the expected value, and the actual value. Here's an example.

def test_sort_randomly_ordered_list(random_list):
    # calculate test oracle
    expected = list(random_list) # note the copy for later!
    expected.sort()

    # calculate actual value, use ``sort`` by mistake
    # should be ``mysort(random_list)``
    random_list.sort()

    assert random_list == expected

Obviously, this test will always pass as list.sort is used for both computations. This is a very common mistake, and if made once in a test suite, I often find it propagating elsewhere due to copy-paste errors. This kind of mistake is applicable in most any language, and is especially easy to make if the redundant function and the function under test have similar names and usage (which was actually not the case here!).

Copy mistakes with references/pointers

Another very common issue that is often related to redundant computation is failing to make a proper copy of a data structure. If you have a look at the previous example, there is comment telling you to note the copy. Compare that with this example:

def test_sort_randomly_ordered_list(random_list):
    # calculate test oracle
    expected = random_list # this is not a copy!
    expected.sort()

    # calculate actual value
    mysort(random_list)

    assert random_list == expected

Just assigning expected = random_list will not create a copy of random_list, but copy the reference to the list. Therefore, both expected and random_list reference the same list. The assertion is then semantically equivalent to assert random_list == random_list, which is obviously true no matter what mysort did with the list. This is a problem in any language that uses references (not C++ references, but pointer-like references), such as Java and Python, or when dealing with pointers in pretty much any language that has them.

Mistakes during setup

This is also fairly common, and can manifest in a variety of ways. The general idea is that the setup is performed such that the outcome of the test is very likely to be the same even if the production code is anything but correct. One example would be that the supposedly randomly ordered list is actually comprised of duplicates of a single element. Let's have a look at an incorrect implementation of the random_list fixture. Note that _ is used as a variable name when we don't care about the value of it.

@pytest.fixture
def random_list():
    """Generate a randomly ordered list with 100 elements."""
    lst = []
    for _ in range(100):
        random.seed(5234) # seed to make list generation deterministic
        lst.append(random.randint(-100, 100))
    return lst

It is good practice to seed the pseudo-random generator (PRG) when testing to make tests reproducible. A PRG is actually a deterministic function that, given an initial state (a seed), will always produce the same sequence of numbers. random.seed(5234) sets this initial state to 5234. This fixture is actually fairly well implemented, but has a critical error. Since the seed is set inside the loop, before the call to random.randint, the latter will always produce the same value. As the list is already sorted, mysort can do almost anything but remove an element and still pass the test. This is a fairly sophisticated error that an intermediate tester may accidentally make. There are infinite variations on how setup may go wrong, and this is applicable to pretty much any programming language. As a side note, the correct way to do this would of course be to seed before the loop. Note that even with the correct configuration, there is a very small chance that the random elements are generated in ascending order.

Mistakes with assertions

The final issue is also common, and comes in many shapes and forms. One thing I sometimes see is that the assertions are tautologies, such as assert random_list == random_list (obviously true), and probably mostly result from typos and unchecked auto-completion. Another common one is that assertions are simply missing, and is most often found in tests that are large enough that a missing line or two is not immediately apparent.

Finding tests that don't test

There are essentially two ways I know of to find tests that (pretty much) never fail.

Write the tests first (Test-driven development)
Inject errors into production code and expect tests to fail

Test-driven development (TDD)

TDD involves writing the test cases before you implement the functionality. You first write the test cases, ensure that the test cases fail, and then implement the production code such that the tests pas. I typically use TDD when:

The functionality I need to implement is strictly defined.
- Fox example when implementing well-defined algorithms and data structures.
I'm fixing a bug.
- Reproduce the bug with a test-case, then fix it!

This approach will catch many incarnations of the errors I've brought up in this article simply because the tests should definitely not pass before the production code is even written. There is one caveat, though. Some practitioners of TDD think that test cases should be written even before the function skeletons have been written, and argue that a compilation failure is also a test failure. With that approach, you probably will not catch any of the errors brought up here, except maybe the first one. My recommendation for TDD is to write function skeletons and make sure the function can actually be called (it's perfectly fine if it crashes after being called). Then write your tests, and make sure they fail before you start implementing production code. I don't think TDD is always practical to use, however, especially when I'm a bit unsure of what to do and need to experiment with different APIs. That's when the second technique comes in real handy.

Inject errors into production code and expect tests to fail

This is a highly useful technique that can always be performed, and I do this almost every time I implement tests after production code. The idea is simply to consider what your test is testing, and inject errors into the production code such that the test should fail. test_sort_randomly_ordered_list is a fairly broad test case, so we can inject fairly general errors. A simple example would simply be to return early such that mysort does not sort at all. Narrower test cases may require more sophisticated errors to be injected.

Aside: Mutation testing There is actually a whole field of testing dedicated to this kind of error (or fault) injection called mutation testing. Faults are automatically injected into production code, and the test suite is run to determine whether the fault is found (killed) or not. There are frameworks for this, such as the Pitest for Java, and Cosmic Ray for Python. In general, it takes a long time to run mutation testing on a test suite, as often the whole test suite needs to be run for a single fault. And there are many, many possible faults.

Summary

While I framed this as a unit testing article, these concepts are applicable to most kinds of testing. You should always attempt to make sure that your test is doing what it claims to be doing. A single typo may be what stands between a test that does not test, and a test that does. This article focused on finding tests that don't test, but there are also things you can do to prevent tests that don't test from manifesting. Copy/pasting test code and then making minor changes is for example a common source of most of the discussed errors. But ultimately, there is no surefire way of avoiding tests that don't test, so I strongly recommend that you actively search for them no matter what precautions you take!

TornadoFX+Exposed pt. 3: Adding, editing and removing rows

2018-12-30T14:50:50+00:00

Welcome to the third and final part in this article series on using TornadoFX together with Exposed. In the previous two parts, we set up the database with a single table and created a simple TornadoFX view with which we could view its contents. Now, we will focus on adding and deleting rows to the Categories table, as well as adding new ones. This part is a bit longer than the two previous ones, but it also contains a whole lot more content.

The full source code is available on GitHub

Article index

Project and database setup
Showing a database table
Adding, editing and removing rows -- This part!

Making the app interactive

So far, all we can do with our app is view the contents of the database. That's neat and all, but it would be even nicer if we could interact with the database and edit its contents. What this article will address is how to:

Delete rows.
Add rows.
Edit rows.

Deleting rows is the simplest thing to accomplish, so let's start with that.

Deleting rows

Deleting a row is pretty easy. First, we'll add the desired functionality to the controller.

fun deleteCategory(model: CategoryModel) {
    transaction {
        model.item.delete()
    }
    categories.remove(model)
}

Note that model.item returns the backing Category object, on which we simply call delete() to remove it from the database. Then, we also have to update our local list by removing the model from it. Note that I assume model to be in the categories list for the sake of simplicity, but this is a pretty bold assumption that you probably should not make in a real application. Now, let's put this new functionality to work: we need to add a button to the view that calls the delete function on the currently selected row. We will slightly alter the layout to make this happen. We change this:

override val root = borderpane {
    categories = dbController.categories

    center = tableview<CategoryModel> {
        categoryTable = editModel
        items = categories

        column("Name", CategoryModel::name)
        column("Description", CategoryModel::description)
    }
}

to this:

override val root = borderpane {
    categories = dbController.categories

    center = vbox {
        buttonbar {
            button("DELETE SELECTED") {
                action {
                    val model = categoryTable.tableView.selectedItem
                    when (model) {
                        null -> return@action
                        else -> dbController.deleteCategory(model)
                    }
                }
            }
        }
        tableview<CategoryModel> {
            categoryTable = editModel
            items = categories

            column("Name", CategoryModel::name)
            column("Description", CategoryModel::description)
        }
    }
}

We use a buttonbar as we will be adding more buttons later on. The code should be fairly easy to read: the button's action will do nothing if the currently selected model is null (i.e. nothing is selected), and call the deleteCategory method otherwise. You should now have a view looking something like this:

If you first click a row and then the delete button, the row should disappear. Now that we can delete rows, let's turn our attention to adding new rows.

Adding new rows

For this, we're going to add a small form to the right of the table which will allow us to enter new rows. As before, we'll start with the controller, adding the following method to it:

fun addCategory(name: String, description: String) {
    transaction {
        val category = Category.new {
            this.name = name
            this.description = description
        }
        categories.add(
            CategoryModel().apply {
                item = category
            })
    }
}

Here, we first create a new Category, and then add it to the categories list (wrapped in a CategoryModel). Now, we need to add the form to the view so we can submit the values for name and description. First, we need to add two new properties to the CategoryEditor view:

var nameField: TextField by singleAssign()
var descriptionField: TextField by singleAssign()

We need these to be able to access what we put in the form fields. We also need to import TextField

import javafx.scene.control.TextField

To add the actual form, we put the following after the center element:

right = form {
    fieldset {
        field("Name") {
            textfield {
                nameField = this
            }
        }
    }
    fieldset {
        field("Description") {
            textfield {
                descriptionField = this
            }
        }
    }
    button("ADD CATEGORY") {
        action {
            dbController.addCategory(nameField.text, descriptionField.text)
            nameField.text = ""
            descriptionField.text = ""
        }
    }
}

This will result in a view looking something like this:

Writing some stuff in the fields and clicking ADD CATEGORY should immediately create a new row in the table. Not the most beautiful thing in the world, I'll admit, but it serves its purpose for this guide. Now we only have one more feature to add, namely editing rows.

Editing rows

Now we will finally see why we used a TableViewEditModel instead of a plain TableView: the former allows us to edit rows directly in the table. To allow for inline editing, we need to add some stuff to the view itself. Our table view currently looks like this:

tableview<CategoryModel> {
    categoryTable = editModel
    items = categories

    column("Name", CategoryModel::name)
    column("Description", CategoryModel::description)
}

To enable editing, we simply add a call to enableCellEditing(), and call makeEditable() on the columns. We'll also add enableDirtyTracking() to allow us to see which cells have been edited, but not saved.

tableview<CategoryModel> {
    categoryTable = editModel
    items = categories

    enableCellEditing()
    enableDirtyTracking()

    column("Name", CategoryModel::name).makeEditable()
    column("Description", CategoryModel::description).makeEditable()
}

Now, we can edit cells by clicking them:

And after pressing enter, we can see that the cell has been edited by the blue triangle. The cell is dirty:

However, the change won't "stick". If we restart the application, the text will be back to what it was before we edited the cell. The reason is that the change was never committed to the database, it was just stored in the model. Thus, what we need now is to commit any dirty rows to the database. As always, we start with adding the functionality we need from the controller.

fun commitDirty(modelDirtyMappings: Sequence<Map.Entry<CategoryModel, TableColumnDirtyState<CategoryModel>>>) {
    transaction {
        modelDirtyMappings.filter { it.value.isDirty }.forEach {
            it.key.commit()     // commit value to database
            it.value.commit()   // clear dirty state
        }
    }
}

This function iterates over a sequence of map entries that map a model (key) to a dirty state (value). We'll soon see that we can get such a map from the table view. Note that committing the key must be done in a transaction, as it will write to the database. The type is a bit of a mouthful, though, so let's define a type alias for it.

typealias ModelToDirtyState = Map.Entry<CategoryModel, TableColumnDirtyState<CategoryModel>>

Note that the typealias must be a top level declaration (i.e. you can't put it in a class or function). And rewrite the header of commitDirty like this:

fun commitDirty(modelDirtyMappings: Sequence<ModelToDirtyState>)

Slightly more readable, right? Now, let's put it to use. We'll add a new button in the button bar to execute the commit.

button("COMMIT") {
    action {
        dbController.commitDirty(categoryTable.items.asSequence())
    }
}

Clicking this button when there are dirty cells will allow us to commit these to the database. As a finishing touch, we'll add a button to reset (rollback) dirty cells to their previous state.

button("ROLLBACK") {
    action {
        categoryTable.rollback()
    }
}

Note that this does not require a transaction, as all that happens is that the model state is reset (the DAO is unaffected). This will leave us with a final GUI looking like this:

Closing words

That was all for this series of articles on TornadoFX and exposed. This is by no means a fully-fledged database UI, but it is a pretty good start. There are tons of things here that need to be improved, though. Below are a few examples off the top of my head.

There is just about no error handling, everything is just assumed to work out. For example, if a user enters a duplicate category, an unhandled exception is raised.
Much of the functionality is very specific to the Category type, and needs to be generalized. As a lot of this is done with generics, such generalization is actually not trivial (as generic types are invariant by default).
There is a lot of room for user error. For example, deleting a row is done without prompting the user with something like "Are you sure you wanna do this?". The commit/rollback functionality of editing is much more user friendly and a step in the right direction.
The views are completely unstyled and look rather dull.

And with that, I wish you good fortune in working with this! Of course, you are free to use all of these examples as you see fit.

TornadoFX+Exposed pt. 2: Showing a database table

2018-12-26T09:09:47+00:00

Welcome to the second part of the TornadoFX+Exposed series of articles. In this part, we'll take a look at how to create a TornadoFX view for the Categories table. In the next part, we'll expand upon the view and make it possible to add, edit and delete rows.

The full source code is available on GitHub

Article index

Project and database setup
Showing a database table -- This part!
Adding, editing and removing rows

Creating a table view

To be able to view the Categories table, we're going to need three things:

A view model to wrap Category instances. We can actually get away without having a model, but having a model greatly simplifies some of the operations we will implement in the next article.
A controller for interacting with the database
A view for displaying the data.

We will do all of this in a new file called categoryview.kt. Let's start with the view model, as it is by far the simplest component.

An ItemViewModel wrapper for Category

For this, we'll extend a utility class called ItemViewModel (you can read about it in detail in the TornadoFX guide. It will simply look like this:

import tornadofx.*

class CategoryModel : ItemViewModel<Category>() {
    val name = bind(Category::name)
    val description = bind(Category::description)
}

This is essentially a proxy for the Category class, acting as a middle-man between the presentation layer and the database access layer. Any change we make to a CategoryModel in the GUI will be stored in the model alone, and will only propagate to the underlying Category object when we commit the change(s). This is very convenient, as it allows us to buffer changes and then commit all of them in a single database transaction, instead of having one transaction per change. Now, let's move on to the controller.

The database controller

The controller is also fairly simple. Initially, it will only be able to fetch items from the database. In the next article, we will extend the controller with add and delete-functionality. Here's the initial version of the controller:

import javafx.collections.ObservableList
import java.sql.Connection
import org.jetbrains.exposed.sql.Database
import org.jetbrains.exposed.sql.transactions.transaction
import org.jetbrains.exposed.sql.transactions.TransactionManager

class DBController : Controller() {
    val categories: ObservableList<CategoryModel> by lazy {
        transaction {
            Category.all().map {
                CategoryModel().apply {
                    item = it
                }
            }.observable()
        }
    }

    init {
        Database.connect("jdbc:sqlite:file:data.sqlite", driver = "org.sqlite.JDBC")
        TransactionManager.manager.defaultIsolationLevel = Connection.TRANSACTION_SERIALIZABLE
    }
}

The categories property is lazily initialized to a fetch from the database, in which all Category DAOs are wrapped in CategoryModels. There's a bit of a trade-off here: it's more efficient to fetch the whole table only once and then maintain the state with any objects that are added or init contains precisely the same database connection setup that we used in the first article. Let's move on to the actual view.

The table view

For the table view, we're going to use a TableViewEditModel instead of a plain TableView. The reason is that the TableViewEditModel has some additional functionality, most notably the ability to edit rows directly in the table. Again, you can read up on the details in the TornadoFX guide. Our initial attempt looks like this:

class CategoryEditor : View("Categories") {
    val dbController: DBController by inject()
    var categoryTable: TableViewEditModel<CategoryModel> by singleAssign()
    var categories: ObservableList<CategoryModel> by singleAssign()

    override val root = borderpane {
        categories = dbController.categories

        center = tableview<CategoryModel> {
            categoryTable = editModel
            items = categories

            column("Name", CategoryModel::name)
            column("Description", CategoryModel::description)
        }
    }
}

There's not too much going on here. The three properties store references to the controller, the table view, and the list of categories. The view itself is not very eventful either, we simply fetch the categories using the controller and initialize the table view. Note that editModel and items are properties of the TableViewEditModel, where the former is a reference to the table and the latter the property containing the items of the table (which we set to the categories observable list). Later, when we wish to update the table, we simply work with the categories list. Don't worry that there are some unused references here, we will put them to use in the next article.

Creating a runnable app

Now, we just need to make the app runnable. That's as simple as adding the following:

class Kuizzy : App(CategoryEditor::class)


fun main(args: Array<String>) {
    launch<Kuizzy>(args)
}

Running the main method will start the app, and you should then see a view that looks something like this:

That's pretty much it for this part. In the next and final part, we'll look into how to add, delete and edit rows of the Categories table. You can find part 3 here.

TornadoFX+Exposed pt. 1: Project and database setup

2018-12-25T22:42:36+00:00

I recently got it into my head that I'd like to make a quiz game with a GUI, which felt like a simple enough diversion during the holidays. Since I already have this site to maintain in terms of web development, I figured that desktop app development in Kotlin using the TornadoFX framework would be a nice change of pace. Kuizzy, which is what I call the project, will obviously need some kind of data storage for questions and the like, so I settled on usind JetBrains' framework Exposed with a sqlite database. Starting out, I had trouble figuring out how to use TornadoFX and Exposed together, and therefore decided to write this three-part series of articles on how I managed to make it work. I won't dive deep into either, but rather show by example how to perform some elementary tasks. In the end, we'll have a small piece of the database admin part of Kuizzy up and running.

The full source code is available on GitHub

Article index

Project and database setup -- This part!
Showing a database table
Adding, editing and removing

Getting started

In this first article, we will be concerned only with getting everything set up. This includes getting the dependencies and setting up the database with a table. I will assume that you know how to use Kotlin, and how to handle dependencies (e.g. by using a build system like Gradle, or just doing it manually). These are things that I will not explain, as there are plenty of resources for that available elsewhere. It's also important to note that these articles are not meant to be seen as the way to do this. In order to keep the articles reasonably focused and short, I take tons of shortcuts, completely eschew error handling and create very specialized functionality. The point of this article series is to show you how to get started with TornadoFX+Exposed, and you are meant to develop it further on your own.

What are we aiming for?

I think it helps tremendously when reading something to have the end goal in sight. What we're shooting for here is an interface that looks something like this:

We will be able to create and delete rows, as well as edit rows directly in the table. It's not pretty and it's not very user friendly, but it conveys an idea and has all the basic functionality required do administrate a single-table database. Now that you have a rough idea of what we're trying to accomplish, let's have a look at what libraries and tools we need to make it happen.

Preliminaries

Before we can get started, we need to make sure all dependencies are accounted for. Here's a complete list of the libraries and frameworks I'll be using throughout this series:

Java 8 and openjfx 8
- Note that if you install Oracle's JDK, JavaFX is included. You only need openjfx if you use openjdk.
Kotlin 1.3.0
- 1.2+ should work fine
TornadoFX 1.7.17
Exposed 0.11.2
xerial-sqlite-jdbc 3.25.2
- I'll be using sqlite, but using any other SQL database supported by Exposed only requires changing a line or two of code.
Gradle 5.0
- Any reasonably up-to-date version should work. You can also just use whatever way you see fit to handle the dependencies.

Here's my build.gradle:

plugins {
    id 'org.jetbrains.kotlin.jvm' version '1.3.0'
}

group 'se.slarse'
version '0.0.1'

repositories {
    mavenCentral()
    jcenter()
}

dependencies {
    compile 'org.jetbrains.kotlin:kotlin-stdlib-jdk8'
    compile 'no.tornado:tornadofx:1.7.17'
    compile 'org.jetbrains.exposed:exposed:0.11.2'
    compile 'org.xerial:sqlite-jdbc:3.25.2'
}

compileKotlin {
    kotlinOptions.jvmTarget = "1.8"
}

compileTestKotlin {
    kotlinOptions.jvmTarget = "1.8"
}

Setting up the database

Any quiz game worth it's salt has categories, and that's the part of the database that we'll develop. Exposed allows us to interact with a SQL database in two different ways: through the SQL Domain Specific Language (DSL), or through the Data Access Object (DAO) pattern. I'll use the DAO, as I thought it meshed nicely with TornadoFX. You can read about both of them on the Exposed GitHub page. We will put all of the database code in a file called database.kt. Let's first define the table for categories.

import org.jetbrains.exposed.dao.*
import org.jetbrains.exposed.sql.*
import org.jetbrains.exposed.sql.transactions.TransactionManager
import org.jetbrains.exposed.sql.transactions.transaction
import java.sql.Connection

object Categories : IntIdTable() {
    val name = varchar("name", 64).uniqueIndex()
    val description = varchar("description", 128)
}

Note that we don't explicitly define the primary key, that's all handled in the background. Along with the table (which is a singleton object), we also need to represent rows.

class Category(id: EntityID<Int>) : IntEntity(id) {
    companion object : IntEntityClass<Category>(Categories)

    var name by Categories.name
    var description by Categories.description

    override fun toString(): String {
        return "Category(name=\"$name\", description=\"$description\")"
    }
}

The Category class is what we'll use to create DAOs, with which we can create, modify and delete rows in the Categories table. Finally, let's create the table and add some rows to it. We won't actually touch the Categories object directly at all.

fun main(args: Array<String>) {
    // "connect" to database file called data.sqlite in the current working directory
    // (creates the file if it does not exist)
    Database.connect("jdbc:sqlite:file:data.sqlite", driver = "org.sqlite.JDBC")
    // this isolation level is required for sqlite, may not be applicable to other DBMS
    TransactionManager.manager.defaultIsolationLevel = Connection.TRANSACTION_SERIALIZABLE

    transaction {
        addLogger(StdOutSqlLogger)
        // create the table
        SchemaUtils.create(Categories)

        // add some entries
        Category.new {
            name = "java"
            description = "The Java programming language"
        }

        Category.new {
            name = "cpp"
            description = "The C++ programming language"
        }
    }

    // new transaction to check the results
    transaction {
        Category.all().forEach { println(it) }
    }
}

Note how all interactions with the database are conducted inside of a transaction (which is a function taking a lambda, that abstracts a database transaction). You'll see this several times throughout these articles. That's it for database setup! If you run main function, you'll get output that looks something like this:

SQL: CREATE TABLE IF NOT EXISTS Categories (id INTEGER PRIMARY KEY, name VARCHAR(64) NOT NULL, description VARCHAR(128) NOT NULL)
SQL: CREATE UNIQUE INDEX Categories_name ON Categories (name)
SQL: INSERT INTO Categories (description, name) VALUES ('The Java programming language', 'java')
SQL: INSERT INTO Categories (description, name) VALUES ('The C++ programming language', 'cpp')
Category(name="java", description="The Java programming language")
Category(name="cpp", description="The C++ programming language")

If you run the main function again, it will fail the unique constraint on the name attribute and crash. And that's it for part 1. In the next part, we'll look at how to create a read-only view of the Categories table. You can find part 2 here.

Collapsing and expanding HTML elements using (mostly) CSS

2018-11-14T17:57:34+00:00

Sections that collapse and expand at the click of a button is fairly ubiquitous across the web nowadays. It's especially handy for mobile, where the display is much smaller than your typical computer monitor. In this article, I'll walk you through how to create a basic collapsible content-area using almost only CSS, along with a few lines of close-to-trivial JavaScript. The focus is on CSS, not JavaScript, so you should be able to follow this even with the most rudimentary programming experience.

The fictional sidebar

For this toy example, we will be creating a collapsible div (it could really be just about any element) that can be collapsed and expanded by clicking a "trigger". Just for the purpose of showing the effects more clearly, we'll do it inside of another div element, which we'll imagine is a sidebar of a website (like the sidebar with recent posts and tags on this site). Here's the markup for the sidebar:

<div class="sidebar">
    <!- our content goes in here! -->
</div>

And the CSS:

.sidebar {
  width: 30%;
  border: black solid 2px;
}

This is really not important for this demo, I just mention it so that you don't wonder about some unknown HTML and CSS in the final demo. Now, let's fill that sidebar up with some content.

<div class="sidebar">
  <div id="trigger">Cool content heading</div>
  <div id="content">
    <p>
      This content would be neat to hide and show at the click of a button!
    </p>
  </div>
</div>

We have 3 div tags in total: one is the container (sidebar) which really has little to do with this article. The second is a heading for the content, which will act as the trigger for showing and hiding the content. The third is the one containing the content that we want to hide/show (a single paragraph). Let's get to it! Note the id attributes on the two inner div tags. When I later refer to the #trigger, I mean the div with id="trigger", and likewise for the #content. The id attributes serve no other purpose here, you can remove them if you wish and everything will still work as expected.

Collapsing and expanding the `div`

To collapse and expand #content, we will use two classes: collapse-trigger and collapse. The basic idea is this:

An element with the collapse is hidden by default.
If a collapse element follows an element with the collapse-trigger AND the active classes, the collapse is visible.

You can probably guess where to put the classes in the markup already:

<div class="sidebar">
  <div id="trigger" class="collapse-trigger">Cool content heading</div>
  <div id="content" class="collapsible">
    <p>
      This content would be neat to hide and show at the click of a button!
    </p>
  </div>
</div>

For the CSS, fulfilling point 1 above (collapsible hidden by default) is simple:

.collapsible {
    display: none;
}

This will simply not display the element. But how do we fulfill the second requirement? We can use the adjacent sibling combinator (+). It's a selector combinator that allows us to match some element, only if it is immediately preceded by some other element. For example, the selector h1 + p will match any p tag that is immediately preceded by an h1 tag:

<h1>This is a heading</h1>
<p>This paragraph will be matched by "h1 + p"</p>

So, to show our collapsible when it is directly preceded by a collapse-trigger AND active element, we do this:

.collapse-trigger.active + .collapsible {
    display: block;
}

We display it as a block here, but one could use other display modes as well, depending on what visual effect is sought. Note that we are chaining two classes in the left hand side of the + combinator, which means that an element matches only if it's class attribute contains both of those classes (and possibly more of them). The order of the classes is however not important, i.e. .active.collapse-trigger would be equivalent.

That's actually all there is to it, as far as the CSS goes. Now, we can collapse and expand the #content by opening the developer tools (F12 in Firefox and Chrome) and manually assigning the active class to #trigger. But that's not very convenient in every day use. This is where we need the tiniest bit of JavaScript to be able to toggle active.

Toggling the `active` class with the click of a button

What we want to do is to remove and add the active class from any collapse-trigger element by clicking it. Here, we need JavaScript, because there is no way to change the class of an element with only CSS. For every collapse-trigger in the page, we need to attach an event listener that toggles the active class every time the element is clicked. It can be done like this:

function attachCollapseTriggers() {
    var colTriggers = document.getElementsByClassName("collapse-trigger");
    for (var colTrig of colTriggers) {
        colTrig.addEventListener("click",  function() {
            this.classList.toggle("active");
        });
    }
}

Essentially, we use the getElementsByClassName DOM method to find all elements with the collapse-trigger class. Then, we iterate over those elements, and add an event listener to it. The first argument to addEventListener is an event (in this case a button click). The second argument is a callback function, for which we provide an anonymous function. If you have a hard time wrapping your head around anonymous functions, this will accomplish the same thing:

function collapseTrigger() {
    this.classList.toggle("active");
}

function attachCollapseTriggers() {
    var colTriggers = document.getElementsByClassName("collapse-trigger");
    for (var colTrig of colTriggers) {
        colTrig.addEventListener("click", collapseTrigger);
    }
}

When the element is clicked, the collapseTrigger function is called. Of course, we need to call attachCollapseTriggers sometime after the page has loaded for this to take effect. And that's it for the JavaScript, clicking the #trigger will now cause #content to collapse and expand! However, it's not very clear to the user that the #trigger even can be clicked. Let's make that just a little bit more clear by adding some visual cues.

Finishing touches using the `::after` pseudo class

A typical visual cue that a drop down can be expanded is a down-triangle (▼). An up-triangle (▲) is as recognizable a cue that a menu can be collapsed. The down-triangle should be appended to any collapse-trigger that is not active, while the up-triangle should be appended to any collapse-trigger that also has the active class. We can do that simply use the ::after pseudo class.

.collapse-trigger::after {
    content: "▼";
    float: right; /* float to the right-hand side of the content box */
}

.collapse-trigger.active::after {
    content: "▲";
}

So, what happens here, exactly? When a collapse-trigger is not active, it doesn't match the .collapse-trigger.active selector, so the content will simply be the down-triangle. When a collapse-trigger is active, it will match both selectors. However, .collapse-trigger.active is more specific than .collapse-trigger, so it wins out, and the content will be an up-triangle. And that's it, all done!

Code listing and JSFiddle link

The full code is available in the following subsections, and you can find a JSFiddle here.

Markup

<!- the outer div with the class "sidebar" isn't important, it's just any container -->
<div class="sidebar">
  <div class="collapse-trigger">Cool content heading</div>
  <div class="collapsible">
    <p>
      This content would be neat to hide and show at the click of a button!
    </p>
  </div>
</div>

CSS

/* the sidebar class is just an arbitrary container for this example */
.sidebar {
  width: 30%;
  padding: 1em;
  border: black solid 2px;
}

.collapse-trigger::after {
    content: "▼";
    float: right;
}

.collapse-trigger.active::after {
    content: "▲";
}

.collapsible {
    display: none;
}

.collapse-trigger.active + .collapsible {
    display: block;
}

JavaScript

function attachCollapseTriggers() {
    var colTriggers = document.getElementsByClassName("collapse-trigger");
    for (var colTrig of colTriggers) {
        colTrig.addEventListener("click",  function() {
            this.classList.toggle("active");
        });
    }
}

attachCollapseTriggers();

A binary search tree in Kotlin pt. 2: Generic node

2018-10-30T08:26:05+00:00

Welcome to part 2 of my series on the idiomatic Kotlin binary tree! In this part, we're gonna have a look at how to make the node representation from part 1 capable of carrying any kind of data (i.e. generic).

Series index

Representing a node
Generic node (this part!)
Generic BST with insert, contains and traversal (coming soon!)

Attribution and reading recommendations

In this part, we'll start working a little bit with binary tree algorithms. More specifically, we'll complete the contains function from part 1. All of the algorithms I implement in this series are based on this article by Stefan Nilsson. If you are unfamiliar with the concepts of binary trees, I highly recommend that you sift through that article before continuing with this one.

Improving `contains`

Recall the contains function that we started working on in part 1. Before we start working on generics, I want us to complete this function. It will make some of the decisions about generics much more apparent. Anyway, here's contains as we wrote it in part 1.

// check if data is contained in node
fun contains(node: ANode, data: Int): Boolean = when (node) {
    is Empty -> false
    is Node -> data == node.data // note implicit cast
}

It's not a particularly useful function, as it only checks the current node. What we'd rather have it do is check the entire subtree, in which node is the root. This can quite easily be performed with recursion. The Empty case still stands, if we hit an empty node, the data is not contained in the tree. If node is a Node, on the other hand, there are three possibilities:

data < node.data, in which case we keep searching in the left subtree.
data > node.data, in which case we keep searching in the right subtree.
data is neither smaller or larger than node.data, so they are comparably equal. If this actually means that they are equal or not is implementation specific, but it is highly recommended that ordering is consistent with equals. We will assume that this is the case.

As we have three distinct cases, we can again use a when expression.

// check if data is contained in the tree rooted in node
fun contains(node: ANode, data: Int): Boolean = when (node) {
    is Empty -> false
    is Node -> when { // no-argument when so we can do arbitrary comparisons
        data < node.data -> contains(node.left, data)
        data > node.data -> contains(node.right, data)
        else -> true
    }
}

Note that the nested when expression has no argument in parentheses, allowing us to perform more complex operations in the matchings. And that's it for the contains operation. It's really quite elegant. We can quickly ammend the main function to try it out:

fun main(args: Array<String>) {
    // create the tree
    //          6
    //         / \
    //        3   9
    //         \
    //          4
    val root = Node(6,
            left = Node(3,
                    right = Node(4)),
            right = Node(9))

    println("Search for elements in the tree")
    for (data in listOf(6, 3, 4, 9)) {
        println(contains(root, data))
    }
    println("Search for elements not in the tree")
    for (data in listOf(10, 2, 1, -12)) {
        println(contains(root, data))
    }
}

With that out of the way, let's dive into making the whole thing generic!

Getting generic

Now, how do we make the node classes generic? A first attempt might be to just change the Node class, and do something like this:

data class Node<T>(val data: T, var left: ANode = Empty, var right: ANode = Empty) : ANode()

fun <T : Comparable<T>> contains(node: ANode, data: T): Boolean = when (node) {
    is Empty -> false
    is Node<T> -> when {
        data < node.data -> contains(node.left, data)
        data > node.data -> contains(node.right, data)
        else -> true
    }
}

This is a reasonable attempt. Before we dive into the problems, let's analyze what we just did. After fun we write <T : Comparable<T>>, which makes this function generic. The addition states that the type parameter T can be substituted by any type that implements Comparable<T> (which we need to be able to use < and >). Node<T> simply defines a type parameter T that can be substituted with any type. It is a bit inconvenient that we could put non-comparable types in a Node object, but we'll see that it sorts itself out when we create the Tree class in part 3. For now, just ignore that detail.

Now, the above code won't compile, for multiple reasons. The first problem is that we can't ask at runtime if node is Node<T> (the compiler will say Cannot check for erased type: Node), becuase information about generics is erased at runtime. We could succesfully match against a wildcard type parameter (meaning any type) with is Node<*>, but then we run into the real showstopper: we don't know whether data and node.data are actually comparable, as they might not have the same type. With the current class hierarchy, there is no reasonable way around this. An ANode is not parameterized, and therefore the dynamic type of an ANode can be any Node type (e.g. Node<Int>, Node<String> etc) or Empty. We have to put the type parameter T higher up in the inheritance chain.

Inheriting from a generic class

Since ANode is the only class higher up in the inheritance chain (apart from Any), this is where we need to put our type parameter. For ANode and Node, it is straightforward.

sealed class ANode<T>

data class Node<T>(val data: T, var left: ANode<T> = Empty, var right: ANode<T> = Empty) : 
        ANode<T>()

Note that Node<T> is inheriting from ANode<T>. We cannot (and wouldn't want to, anyway) leave it as ANode, because unlike Java, Kotlin does not support raw types. Now, since we cannot inherit from ANode, but must specify the type parameter with a concrete type, what do we put there for Empty? In the case of Node<T>, we simply inherit from ANode<T>, because T is declared as a parameter to Node<T> and is therefore concrete for for ANode. We can't however just do something like

object Empty : ANode<T>()

because T is not declared in that scope. We also can't give Empty a type parameter, because Empty is a singleton object, and type parameters simply don't work with singletons (it wouldn't make much sense, if you stop to think about it for a while). What we actually want to put as the type parameter, is nothing. Literally. We wan't Nothing, a concrete type in Kotlin which is a subtype of every non-nullable type.

object Empty : ANode<Nothing>()

Note that we don't want to put Any (which is the supertype of every non-nullable type), because the we wouldn't be able to assign Empty to any concrete type of ANode<T>. Now you may be angrily shouting that you still can't assign Empty to any type of ANode<T>. Unfortunately, ANode<T> (for any substition of T) is invariant. Let's fix that.

Generics are invariant by default, but Kotlin can stretch the rules

Any generc class is invariant by default. What does this mean? In short, it means that a generic type (e.g. ANode<Int>) is not a supertype, nor subtype, of any other type. Formally, it means that if we have two types A and B such that B is a subtype of A, ANode<B> is not a subtype of ANode<A>. Take for example Empty, which subtypes ANode<Nothing>. It is not a subtype of ANode<Int>, even though Nothing is a subtype of Int. Inconvenient, we want Empty to be a subtype of any concrete ANode. We can achieve this by using the out modifier, and declaring Anode<out T>. Formally, we make ANode covariant on the type parameter T. We can only do this if every use of T is in an out position (i.e. return values). Note that this restriction applies only to the body of ANode, the T in Node<T> is not the same type parameter as in ANode<out T>. If you found all of that confusing (I sure did the first time I read about it), you can read more about variance in the Kotlin docs on generics. Here is what the working class hierarchy looks like:

sealed class ANode<out T>

object Empty : ANode<Nothing>()

data class Node<T>(
        val data: T,
        var left: ANode<T> = Empty,
            var right: ANode<T> = Empty) : ANode<T>()

contains now looks like this:

// check if data is contained in node
fun <T : Comparable<T>> contains(node: ANode<T>, data: T): Boolean = when (node) {
    is Empty -> false
    is Node<T> -> when {
        data < node.data -> contains(node.left, data)
        data > node.data -> contains(node.right, data)
        else -> true
    }
}

Now, you may be asking yourself why in the world we can do the is Node<T> check now, while we could not before? Well, we're not actually checking at runtime whether node is Node<T>, because the compiler knows any variable with the static type ANode<T> is either Empty, or Node<T>. So, for example, ANode<Int> must have dynamic type Empty or Node<Int>, there are no other possibilities. As the compiler knows this, we can in fact skip the <T> and just write is Node. That's all we need of our node classes, so we can move on to implement the Tree class in part 3!

Final code listing

This is the final state of the code that we'll be using for part 3.

sealed class ANode<out T>

object Empty : ANode<Nothing>()

data class Node<T>(
        val data: T,
        var left: ANode<T> = Empty,
        var right: ANode<T> = Empty) : ANode<T>()

// check if data is contained in node
fun <T : Comparable<T>> contains(node: ANode<T>, data: T): Boolean = when (node) {
    is Empty -> false
    is Node<T> -> when {
        data < node.data -> contains(node.left, data)
        data > node.data -> contains(node.right, data)
        else -> true
    }
}

fun main(args: Array<String>) {
    // create the tree
    //          6
    //         / \
    //        3   9
    //         \
    //          4
    val root = Node(6,
            left = Node(3,
                    right = Node(4)),
            right = Node(9))

    println("Search for elements in the tree")
    for (data in listOf(6, 3, 4, 9)) {
        println(contains(root, data))
    }
    println("Search for elements not in the tree")
    for (data in listOf(10, 2, 1, -12)) {
        println(contains(root, data))
    }
}

A binary search tree in Kotlin pt. 1: Representing a node

2018-10-28T15:13:08+00:00

In my journey to become a somewhat competent Kotlin developer, I've decided to implement a few of the basic data structures that I've picked up during my three years of computer science studies. First up, we have a generic binary tree. This is an interesting case, because it lets us both delve into generics in Kotlin, and some aspects of inheritance that differ from inheritance in Java (in a good way!). As I want to cover some topics in depth, this will be a three-part series with the following content:

In the first part, we'll develop a basic class hierarchy for representing nodes. It covers object types, data classes and sealed classes, as well as some other related topics. The node is however restricted to only carry Int data.
In the second part, we'll expand upon the class hierarchy from part 1 to create a generic node class that can hold any type of data.
Finally, we'll use the results of part 2 to develop a rudimentary binary tree in (what is in my opinion) idiomatic Kotlin.

If you already know about object types, data classes and sealed classes, I recommend that you skip directly to part 2. If you are already comfortable with generics, including generic inheritance, you may skip directly to part 3.

Series index

Representing a node (this part!)
Generic node
Generic BST with insert, contains and traversal (coming soon!)

Goals and intended audience

I write articles mostly for myself, and as such, this article series is intended for developers with some experience with Java looking to get into Kotlin. Let's get at it then, shall we?

Representing a node: A Java-like attempt

As I see it, a tree node can be one of two things: existent, or non-existent. In other words, it can be a node or an empty node. As Kotlin is, thankfully, quite adverse to using null, I will refrain from doing so as well. So what we want is an abstract node class ANode and sub-classes Node and Empty. Let's give it a first try in a pretty Java-like manner, and then improve upon it with some neat Kotlin language constructs.

abstract class ANode

class Empty : ANode()

class Node : ANode {
    val data: Int
    var left: ANode
    var right: ANode

    constructor(data: Int) : super() {
        this.data = data;
        right = Empty()
        left = Empty()
    }
}

If you've had any experience with any remotely Java-looking language, you can probably guess what's going on here. There's the abstract ANode class, the Empty class representing the absence of a node and the Node class representing an actual node. Note also that we have not delved into generics yet, this is a node that can only hold Int data. That's fine for now, we'll expand upon this implementation with generics in part 2. When we later implement the binary tree, we will often want to distinguish between a Node and Empty. One such case is when we search the tree for a given value, to see if it is contained in the tree. This operation can be succinctly expressed using recursion, but let us leave that for part 2. For now, let's just check the first node (the root), without exploring its children.

// check if data is contained in node
fun contains(node: ANode, data: Int): Boolean = when (node) {
    is Empty -> false
    is Node -> data == node.data // note implicit cast
    else -> throw IllegalArgumentException("node argument was neither Empty nor Node!")
}

This is of course a pretty stupid function at this point, but we'll make it much more worthwhile in part 2. Note the expression body used here, in combination with a when expression. If you are unfamiliar with those concepts, follow the links and read up on them, they will be crucial when implementing the tree algorithms in parts 2 and 3. Also note the implicit cast occurring on the second line of the function. Since we used is Node to match node, the compiler can infer that node is in fact a Node object, and we can safely dereference it with node.data! Finally, note that the else case is needed as the compiler does not know that there are only two subclasses of ANode (even though we currently do, in this very small project). We'll see how to resolve that shortly. Let's try this function out:

>>> contains(Empty(), 2)
false
>>> contains(Node(2), 2)
true
>>> contains(Node(2), 3)
false

It seems to work just fine. We could leave the class hierarchy like this and jump straight into generics. There are, however, three notable problems with the node classes.

For each empty node we need, a new instance of Empty is created. This is wasteful.
The body of Node is a whole lot of code for very little functionality.
The compiler can't tell that Node and Empty are the only subtypes of ANode, forcing us to use an else in the when expression.

As it turns out, all of these problems are easy to solve in Kotlin!

Problem 1 solution: Singleton objects

Problem number 1 can be solved very easily, as Kotlin has language support for the singleton pattern. We simply swap this declaration

class Empty : ANode()

for this declaration

object Empty : ANode()

Empty is now a singleton object, so we can assign it without instantiating Emptys all over the place. The constructor for Node now looks like this:

constructor(data: Int) : super() {
    this.data = data;
    right = Empty       // note the lack of parentheses!
    left = Empty
}

One problem solved, two to go!

Problem 2 solution: Primary constructors and data classes

We can solve problem number 2 with Kotlin's syntax for primary constructors. Instead of defining Node the Java way, we do it the Kotlin way:

class Node(val data: Int, var right: ANode = Empty, var left: ANode = Empty) : ANode()

This is almost equivalent to the previous declaration, with the exception that right and left are assigned default values in the header such that they can be replaced by explicit arguments when calling the constructor. Note that ANode must be instantiated right there in the header as well. However, since we know that Node will always be a simple container, we can do one better here by prepending data to the declaration.

data class Node(val data: Int, var right: ANode = Empty, var left: ANode = Empty) : ANode()

This makes Node a data class, which among other things come with implementations of equals and toString. A fortunate accident here is that the toString of Node will actually let us view the whole tree with very little effort, as toString will be called on both left and right, recursively (this will be demonstrated in part 3). Do be careful not to create a cycle, though, as this will cause a stack overflow, endlessly calling toString (a tree, by definition, has no cycles, so we are good in this case).

Problem 3 solution: Sealed classes

To reiterate, the problem was that the compiler can't tell that Node and Empty are the only subtypes of ANode. Therefore, we needed the else in the when expression to cover up the non-existent case of the argument to contains being anything else.

// check if data is contained in node
fun contains(node: ANode, data: Int): Boolean = when (node) {
    is Empty -> false
    is Node -> data == node.data
    else -> throw IllegalArgumentException("node argument was neither Empty nor Node!")
}

We can, however, tell the compiler that Node and Empty are the only subtypes by making ANode a sealed class. Any subclass of a sealed class must be declared inside the same file, which lets the compiler know precisely which subtypes can exist. To accomplish this, we simply replace the abstract modifier with sealed (because sealed implies abstract, we don't need the latter).

sealed class ANode

We can now drop the else from contains, because the compiler knows that a variable with static type ANode is either Empty, or a Node, there are no other possibilities.

// check if data is contained in node
fun contains(node: ANode, data: Int): Boolean = when (node) {
    is Empty -> false
    is Node -> data == node.data
}

Let's give it a try

>>> contains(Node(2), 2)
true
>>> contains(Empty, 2)
false

Neat, now we have a good base for venturing into the fraught land of generics in part 2.

Final code listing

The final version of the code, that we'll use in part 2, can be found below. I've also included a main function such that you can run the code in your preferred way, right off the bat!

sealed class ANode

object Empty : ANode()

data class Node(val data: Int, var left: ANode = Empty, var right: ANode = Empty) : ANode()

// check if data is contained in node
fun contains(node: ANode, data: Int): Boolean = when (node) {
    is Empty -> false
    is Node -> data == node.data // note implicit cast
}

fun main(args: Array<String>) {
    println(contains(Empty, 2))
    println(contains(Node(2), 2))
    println(contains(Node(2), 3))
}

Creating a standalone (runnable) Kotlin .jar file with IntelliJ and Gradle

2018-10-17T20:15:46+00:00

I've recently started dabbling in some Kotlin, and have found it a very pleasant experience. One of the first things I wanted to do was to create a standalone .jar file, including the Kotlin runtime and any other dependencies. This, as it turns out, was a bit tricky. In this short article, I will walk you through creating a small command line application using the awesome clikt library, and then packaging it into a standalone .jar.

Setting up

Start out with creating a new project by going to File -> New -> Project, select Gradle in the leftmost menu bar (i.e. not Kotlin), and then tick the Kotlin box in the Additional Libraries and Frameworks menu. Then just fill in any GroupId, ArtifactId and Version (I will use slarse, app and 0.1 for these fields, respectively). Then just click Next with the defaults until the project is created.

Initial Gradle configuration

In the project root, you should now have a file called build.gradle, which looks something like this:

plugins {
    id 'org.jetbrains.kotlin.jvm' version '1.2.51'
}

group 'slarse'
version '0.1'

repositories {
    mavenCentral()
}

dependencies {
    compile "org.jetbrains.kotlin:kotlin-stdlib-jdk8"
}

compileKotlin {
    kotlinOptions.jvmTarget = "1.8"
}
compileTestKotlin {
    kotlinOptions.jvmTarget = "1.8"
}

Before we can compile a project with clikt, we need to add it as a dependency. We can do that by adding compile "com.github.ajalt:clikt:1.5.0" in the dependencies section. It should now look like this:

dependencies {
    compile "org.jetbrains.kotlin:kotlin-stdlib-jdk8"
    compile "com.github.ajalt:clikt:1.5.0"
}

Then hit the little refresh symbol in the bottom left corner (should say Refresh Gradle Project when you hover your mouse over it) to download the new dependency. And that's it for now! We'll get back to the gradle.build file once we want to configure our jar task, but let's create the app first!

Creating the application

Let's make this easy: we'll just use the sample application available from the clikt documentation. It looks like this:

class Hello : CliktCommand() {
    val count: Int by option(help="Number of greetings").int().default(1)
    val name: String by option(help="The person to greet").prompt("Your name")

    override fun run() {
        for (i in 1..count) {
            echo("Hello $name!")
        }
    }
}

fun main(args: Array<String>) = Hello().main(args)

Create a Kotlin file called main.kt at src/main/kotlin/main.kt and paste the above code into it. Note that we are using the default package here (i.e. not defining a package) for the sake of simplicity.

For this to compile, we will need to add the following imports at the top:

import com.github.ajalt.clikt.core.CliktCommand
import com.github.ajalt.clikt.parameters.options.default
import com.github.ajalt.clikt.parameters.options.option
import com.github.ajalt.clikt.parameters.options.prompt
import com.github.ajalt.clikt.parameters.types.int

And that's it for the application, you should now be able to run it as usual. When running it, there should appear a prompt in the terminal saying Your name:. With that out of the way, the only thing left to do is to package our fantastic application into a standalone .jar file.

Packaging the application into a standalone `.jar` file

This is actually not very difficult, but you need to know what to do. We need to create a so-called "fat" jar, which includes both the Kotlin runtime and the clikt library. We also need to specify the name of our main class.

jar {
    manifest {
        attributes 'Main-Class': 'MainKt'
    }
    from {
        configurations.compile.collect { it.isDirectory() ? it : zipTree(it) }
    }
}

Note that the class file generated by Kotlin for a file called something.kt will be SomethingKt.class, which is why our main class is called MainKt. With that in mind, the manifest section is self-explanatory: we specify the main class. The from section collects all compile dependencies (that we specified in the dependencies section) and package them with the .jar file. The little piece of logic in the lambda is to properly add directories and .jar files, respectively (directories are just added, .jar files are unzipped and added).

Important: The main class file must be specified with its fully qualified name. For example, if I were to define main.kt in the package se.slarse, then I would need to put se.slarse.MainKt instead of just MainKt in the manifest.

Anyway, that's really all we need to do. It should now be possible to run the jar Gradle task to produce a .jar file in build/libs/<ArtifactId>-<Version> (so in my case it is at build/libs/app-0.1.jar). And that's it, hope it helped someone!

Full source code and `build.gradle`

Here are both of the files we wrote in this tutorial, in their entirety.

// main.kt
import com.github.ajalt.clikt.core.CliktCommand
import com.github.ajalt.clikt.parameters.options.default
import com.github.ajalt.clikt.parameters.options.option
import com.github.ajalt.clikt.parameters.options.prompt
import com.github.ajalt.clikt.parameters.types.int

class Hello : CliktCommand() {
    val count: Int by option(help="Number of greetings").int().default(1)
    val name: String by option(help="The person to greet").prompt("Your name")

    override fun run() {
        for (i in 1..count) {
            echo("Hello $name!")
        }
    }
}

fun main(args: Array<String>) = Hello().main(args)

// build.gradle
plugins {
    id 'org.jetbrains.kotlin.jvm' version '1.2.51'
}

group 'se.slarse'
version '0.1'

repositories {
    mavenCentral()
}

dependencies {
    compile "org.jetbrains.kotlin:kotlin-stdlib-jdk8"
    compile "com.github.ajalt:clikt:1.5.0"
}

compileKotlin {
    kotlinOptions.jvmTarget = "1.8"
}
compileTestKotlin {
    kotlinOptions.jvmTarget = "1.8"
}

jar {
    manifest {
        attributes 'Main-Class': 'MainKt'
    }
    from {
        configurations.compile.collect { it.isDirectory() ? it : zipTree(it) }
    }
}

Awesome Python Podcasts

2018-07-03T07:34:33+00:00

Whenever I find myself occupied with some monotonous task, I very much enjoy listening to podcasts. As programming is my number one passion, and Python is my favorite language, I tend to listen to podcasts that relate to them. In this post, I'll give a brief overview of my three favorite podcasts, and just why I enjoy them as much as I do!

Python Bytes

Python Bytes was the first podcast I ever listened to. It's a really neat show that comes out on a weekly basis, and focuses on delivering news and headlines in the Python community. The best part about the show is that they highlight awesome Python packages and tools that I would not have heard about otherwise. The episodes are fairly short, usually around the 20 minute mark, so they fit in even on a short commute. The episodes have, as far as I can recall, been published every week without fail for almost two years now, which is really nice. The hosts (Brian Okken and Micheal Kennedy) have great chemistry, and the show is recorded with decent enough equipment and well edited. All in all, I really enjoy the show and highly recommend it!

Talk Python To Me

Talk Python To Me is Micheal Kennedy's (from Python Bytes) own show. In each episode, Micheal invites someone (sometimes multiple people at once) from the Python community to come talk about what they do. The episodes are fairly lengthy and often reach for the 1 hour mark, but they are also mostly entertaining throughout. As with Python Bytes, the episodes are well edited, meaning that awkward pauses and the like shine with their absence. Talk Python To Me is probably my favorite podcast right now and I can't recommend it enough.

Podcast.init

_Podcast.__init___ is very similar to Talk Python To Me, seeing as both shows revolve around inviting prominent personalities to talk about their work. I haven't listened to all that many of the episodes, but I really enjoyed the first 7-8 episodes. Initially, the show had two hosts, but in the later episodes one of the original hosts is conspicuously abscent, which to me was to the detriment of the show. It's still a good show, mind you, but I enjoyed the early episodes more than the few late ones that I've listened to. The length of the episodes seem hover around the 1 hour mark, +/- some 20 minutes. I will probably revisit this post once I've listened to a few more of the episodes, but as it stands I recommend listening to the show from the beginning.

Did I miss something?

Those were my 3 top picks for Python podcasts. If you feel like I've missed some great podcast(s), feel free to drop a comment!

What the self? Python's self demystified!

2018-05-01T16:04:01+00:00

Any Python programmer will sooner or later want to (or have to) write a class. With classes come self, the seemingly (do note the emphasis there) magical keyword that you just have to write out as the first parameter to every method. To really understand classes, you need to understand what self actually is: neither magical, nor a keyword. Let's demystify this integral part of Python classes!

`self` is not a keyword

This is pretty easy to prove. Just open a Python interpreter and import the keyword module.

>>> keyword.iskeyword("for")
True                            # aha, makes sense
>>> keyword.iskeyword("else") 
True                            # seems to be working
>>> keyword.iskeyword("self")
False                           # proof!

Alternatively, one can always consult the list of keywords in the Python docs. Since self is not a keyword, it has no special significance in the language itself. We can also verify that it's not some funky builtin by simply typing it out in the interpreter.

>>> self
Traceback (most recent call last):
    File "<stdin>", line 1, in <module>
NameError: name 'self' is not defined
name 'self' is not defined

This can verified here). So, what in the world is self? Actually, it's just a variable name.

Trivia: If you try keyword.iskeyword("True") and keyword.iskeyword("False") in both Python2 and Python3, you will find that both are keywords in Python3, but not in Python2 (in 2, True and False are just builtin constants). In fact, True and False are not even write-protected in Python2, leading to shenanigans such as True, False = False, True being possible. In Python3, the keyword status of True and False make such an assignment a syntax error.

`self` is just a variable name

Consider the following code snippet of a (pretty useless) class that just stores two values (that are just assumed to be addable with each other), and defines a method that returns the sum of the values.

class Tuple:
    """A class for storing two values."""

    def __init__(self, first, second):
        self.first = first
        self.second = second

    def sum(self):
        """Return the sum of 'first` and 'second'."""
        return self.first + self.second

Okay, so the class is terrible, but that really doesn't matter for the purposes of this article. Now we see self in action for the first time. From the code, it's purpose is quite clear: it refers to the object instance on which the method is called. Usage looks something like this:

>>> t = Tuple(4, 3)
>>> t.first
4
>>> t.second
3
>>> t.sum()  # `self` in `sum` refers to `t`
7

So, self is just a reference to the object on which the method is called (in this case, t). This will probably become more apparent when reading Two ways to call methods further down, but just suspend your disbelief for moment and assume it is so. But, considering the self is just a variable to which t is assigned, what happens if we replace self with, say, donkey?

class Tuple:
    """A class for storing two values."""

    def __init__(donkey, first, second):
        donkey.first = first
        donkey.second = second

    def sum(donkey):
        """Return the sum of 'first' and 'second'."""
        return donkey.first + donkey.second

In fact, this will work exactly the same as when the first parameter was called self, and usage is unchanged:

>>> t = Tuple(4, 3)
>>> t.first
4
>>> t.second
3
>>> t.sum()
7

To summarize, the first parameter in a method is simply a variable that refers to the instance on which the method was called. Naming it self is just a convention , and we could name it anything. Note also that there is no technical need for consistency across methods, we could name the first parameter to the __init__ method donkey, and the first (only) parameter to the sum method shrek, and it'd still work.

class Tuple:
    """A class for storing two values."""

    def __init__(donkey, first, second):
        donkey.first = first
        donkey.second = second

    def sum(shrek):
        """Return the sum of 'first' and 'second'."""
        return shrek.first + shrek.second

IMPORTANT: Always name the first parameter to a method self. The convention exists for a reason: it makes your code more readable.

We could leave it at this, and hopefully walk away with a slightly better understanding of why we put self everywhere in methods. But I think diving just a little bit deeper into where the first argument to methods actually comes from will prove fruitful.

Two ways of calling methods

Methods are defined on the class itself, and not on the instance. There is actually a way to call a method directly on the class, that is equivalent to the way we usually call methods on the instance.

>>> t = Tuple(5, 6)
>>> t.sum()         # regular method call
11
>>> Tuple.sum(t)    # calling the method on the class itself, and passing
11                  # `t` as the `self` argument

The second way of calling sum is explicit about where self comes from: it's passed in as the first argument. If we think of the first, "regular" way of calling methods as shorthand for the second, it's suddenly entirely clear what the first argument actually is (the instance itself). Calling a method that has more parameters than just self works as expected: simply pass in the additional arguments. To be super clear, with the following method added to Tuple

def sum_mod(self, mod):
    """Return the sum of the members modulo 'mod'."""
    return (self.first + self.second) % mod

the following two method calls are equivalent

>>> t = Tuple(5, 7)
>>> t.sum_mod(5)
2
>>> Tuple.sum_mod(t, 5)
2

And that pretty much concludes what I wanted to cover in this article!

Summary

In the beginning, someone (probably Guido) said let there be self. And there was. The first parameter to a method is since then, by convention, called self, and refers to the object on which the method was called. Calling a method on some object t (e.g. t.sum()) can be viewed as shortand for calling the method on its class, and passing in a reference to t as the first argument (e.g. Tuple.sum(t)). If you are interested in learning about the dark magic going on behind the scenes, you can read up on the official documentation for the descriptor protocol, and more specifically the Functions and Methods part of it. It is however somewhat advanced, and I don't find it essential to understanding the semantics of method calls in Python. I hope you have found this article enlightening, stay tuned for more Python in the coming week!

Properties as Pythonic setters

2018-04-05T18:26:10+00:00

This is the second part in a two part series on Python properties. In Part 1 (which readers will be assumed to have at least skimmed through), we saw how a property can be used to create a read-only attribute that can be accessed like any data attribute (i.e with obj.attr), but raises an AttributeError when written to. Now, we will look at how to expand the property to also allow us to write to count like it's a normal data attribute (i.e. with t.count = 42), while also doing input validation.

A property as a Pythonic setter

Using the Ticker class version from the final listing in Part 1, we are unable to set the count attribute to any value.

>>> t = Ticker(24)  # valid range for count is thus [0, 23]
>>> t.count = 11    # this is well within that range
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
AttributeError: can't set attribute
can't set attribute

>>> for _ in range(11): # doing it the hard way ...
...     t.tick()
>>> t.count
11

This presents something of a usability issue, as the only way to set the Ticker's internal count to a specific value (using the public API) is by calling tick() an appropriate amount of times. If we were to use the Ticker as, say, a clock, we'd definitely want to be able to set count to a value within the range [0, _end) by simple assignment. Fortunately, there is a simple way to expand a property with a setter method using the @<name>.setter decorator, where <name> is replaced with the name of the property. For the count property of the Ticker class, it looks like this:

@count.setter
def count(self, val):
    """Set the internal count to val."""
    if val < 0 or val >= self._end:
        raise ValueError(f"{val} is out of range for attribute count.")
    self._count = val

Note: A string literal preceeded with an f is an f-string. This is a Python 3.6 feature. For backwards compatability, you could change to using string.format like this: "{} is out of range for attribute count.".format(val)

The code should be fairly self-explanatory. The setter takes a value val as an argument. If val is outside of the allowed range [0, _end), a ValueError is raised. Otherwise, _count is set to val. The error message could be more informative, but I did not want to obscure the important parts with a lot of text. We have thus defeated the aforementioned usability issue, and usage now looks like this:

>>> t = Ticker(24)
>>> t.count
0
>>> t.tick()
>>> t.count
1
>>> t.count = 11
>>> t.count
11
>>> t.tick()
>>> t.count
12
>>> t.count = 24
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "<stdin>", line 23, in count
ValueError: 24 is out of range for attribute count.
24 is out of range for attribute count.

Seems to work just the way we want it to!

Ticker full listing (with getter/setter property)

Here is the full listing of the Ticker class.

class Ticker:
    """A Ticker ticks from 0 to an upper limit, and then starts over."""

    def __init__(self, end: int):
        """Create a Ticker that starts over at end"""
        if end <= 0:
            raise ValueError("end must be greater than 0!")
        self._end = end
        self._count = 0

    def tick(self):
        """Increment the internal count by 1."""
        self._count = (self._count + 1) % self._end

    @property
    def count(self):
        """Return the current count."""
        return self._count

    @count.setter
    def count(self, val):
        """Set the internal count to val."""
        if val < 0 or val >= self._end:
            raise ValueError(f"{val} is out of range for attribute count.")
        self._count = val

Properties as Pythonic getters

2018-04-05T18:25:55+00:00

If you come from either Java or C++, you've probably written your fair share of getter and setter (also called accessor and mutator) methods. It is common for programmers that transition from such a language to Python to carry over this practice. In many cases in Python, we simply forego the abstraction and access the attributes directly. Sometimes, however, getters and setters are useful for providing write-protection and input validation. In this two-part series, we are going to explore how to make Pythonic setters and getters using one of my favorite Python features: properties.

Part 1 (this part): Properties as Pythonic getters

In this first part, we take a look at how to use a property to implement a read-only data attribute that can be accessed just like any other data attribute (e.g. like obj.attr). Writing to it will, however, result in an AttributeError. This is useful for preventing users from accidentally changing the internal state of an object in an unintended way, while still providing a uniform API. For example, we might want a way to access the root element of a binary tree, but without risking to alter its container.

Part 2: Properties as Pythonic setters

In the second part, we'll have a look at how we can use properties to also implement a setter method, with input validation, that can be utulized just like any plain ol' data attribute (e.g. like obj.attr = 42). This is useful when the attribute has some legal set of values.

The Ticker class

For the purpose of learning properties, we will develop a fairly useless class called Ticker. All it does is tick from 0 to some boundary, and then restart from 0. Two Ticker instances could, for example, represent a rudimentary clock with hour and minute counts. The first version of Ticker is outlined below.

class Ticker:
    """A Ticker ticks from 0 to an upper limit, and then starts over."""

    def __init__(self, end: int): # ': int' is an optional type hint
        """Create a Ticker that starts over at end"""
        if end <= 0:
            raise ValueError("end must be greater than 0!")
        self._end = end
        self.count = 0

    def tick(self):
        """Increment the internal count by 1."""
        self.count = (self.count + 1) % self._end

We can use this class something like this:

>>> t = Ticker(5)
>>> t.count
0
>>> t.tick()
>>> t.tick()
>>> t.count
2
>>> for _ in range(3):
...     t.tick()
>>> t.count
0
>>> t.count = 42    # uh oh...
>>> t.count
42                  # this is an illegal state
>>> t.tick()        # back to a legal state in the next tick
>>> t.count
3

As long as the count variable is only read from, there are no issues with this design. Unfortunately, directly assigning to count may put the Ticker in an illegal state, i.e. such that count is outside of its expected range of [0, _end). This isn't so much an issue for the Ticker itself, as it is returned to a legal state on the next tick. Other functionality depending on the Ticker to keep within the [0, _end) range could however be in for a nasty surprise, meaning that there is a serious usability issue here.

Thus to the crux:

How do we protect the count variable from being put in an illegal state, while still allowing access to it?

Solving the problem

First of all, we should make the count variable private (which in Python equates to prepending an underscore). The issue that remains to be resolved is how to expose _count in the public API of the class.

Solution 1: A Java-style getter

A Java or C++ programmer might instinctevly think of a traditional getter method.

def get_count(self):
    """Return the current count."""
    return self._count

This solution has two issues: it breaks the api, and it makes us think about count as something more complicated than the mere data attribute that it is. It would be much preferable if we could access _count just like we accessed it before it was made private (i.e. with t.count), but at the same time provide write protection (such that t.count = 42 raises an error). Enter the property.

Solution 2: Using a property as a read-only data attribute

Implementing the same functionality as get_count() with a property is dead simple.

@property
def count(self):
    """Return the current count."""
    return self._count

We use the @property decorator to say that the count method is a property. This will let us invoke the count method without providing the parens, so it will look like we are just accessing a data attribute named count. Usage now looks like below:

>>> t = Ticker(5)
>>> t.count
0
>>> t.tick()
>>> t.tick()
>>> t.count
2
>>> for _ in range(3):
...     t.tick()
>>> t.count
0
>>> t.count = 42
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
AttributeError: can't set attribute
can't set attribute

Excellent! We have the exact same API as when count was a public attribute, but without the risk of accidental overwriting. This is precisely what we wanted, and a Pythonic way of dealing with the issue of providing read access to fragile state variables.

Ticker full listing

It always annoys me when I get to the conclusion of some tutorial, and the end result is just assumed to be obvious. Therefore, here is the full listing of Ticker with a property as a getter.

class Ticker:
    """A Ticker ticks from 0 to an upper limit, and then starts over."""

    def __init__(self, end: int):
        """Create a Ticker that starts over at end"""
        if end <= 0:
            raise ValueError("end must be greater than 0!")
        self._end = end
        self._count = 0

    def tick(self):
        """Increment the internal count by 1."""
        self._count = (self._count + 1) % self._end    

    @property
    def count(self):
        """Return the current count."""
        return self._count

Now is about the time to move on to Part 2, in which we expand on the count property to allow us to set the internal count, but only within the range [0, _end)!

Programming for fun and profit

Book Review: Cybersecurity Myths and Misconceptions

The book in a nutshell

What I liked

What I didn't like

Conclusions

PostgreSQL indexing: The basics

The incredible impact of indexes

Indexing in theory

The B-tree

Selecting ranges is almost as fast as single values

Indexes greatly speed up ORDER BY

Indexing pitfalls

The query planner can choose not to use an index

An index is for an exact expression

Indexes optimize reads but slow down writes

Summary

Configuring touchpad tap in Sway

Configuring a libinput device in Sway

Summary

Syntax highlight anything with Tree-sitter

What's this Tree-sitter thing?

Working example: Markdown Simple

Getting started with creating Tree-sitter parsers

Baby's first grammar rule

Let there be color

Refining the grammar

Capturing inline code and code blocks

Capturing code blocks

Paragraphs with text and inline code

Resolving a conflict in the grammar

Improving the highlight queries

Injecting a JavaScript parser

Summary

Extending NeoVim for commenting and uncommenting code blocks

Commenting out code the hard way

Defining a command to comment out code

Adding support for range selection

Choosing line comment style by filetype

Uncommenting code

Summary and full code

Adding guardrails to psql for PostgreSQL

Configuring psql with .psqlrc

Making the default transaction read-only

Disabling autocommit

Summary

First impressions of Wayland on Arch Linux

What the heck is a window system?

Wayland first impressions with Sway

Upsides of Sway

Downsites of Sway

Conclusions

Dependabot's dependency grouping is awesome

Dependabot's big problem: Pull request spam

The fix: Grouped dependencies

A configuration example

Effects of the configuration

Ignoring certain dependencies in a group

Closing thoughts

A new dark theme for the blog!

Before and after

Parts of the puzzle

Enjoy!

What does the number in a man page mean?

man pages are divided into sections

Selecting man pages from different sections

And that's all!

The sheer insanity of interfaces and nil in Go

The sensible kind of nil

The completely insane kind of nil

The confusing type-and-value composition of interfaces

Actually, interfaces can also be "completely" nil

Structs are a problem

Footguns

Book Review: Writing an Interpreter in Go

The book in a nutshell

What I liked

What I didn't like

Conclusions

Fix 3D graphics in Arch Linux on Dell XPS 15 9520

Indexes greatly speed up `ORDER BY`

Configuring `psql` with `.psqlrc`

`man` pages are divided into sections

The sensible kind of `nil`

The completely insane kind of `nil`

Actually, interfaces can also be "completely" `nil`

Using `--lf` to rerun failed tests

Using the `-k` option to select tests by substring matching

Using the `-m` option to select by marker