Article
0 comment

Pointing Readers in the Right Direction

Welcome back to this mini-series on the basic elements of an index.

I’m currently looking at what makes up an entry, which I described as “what this thing is + where to find it.” In my previous post I discussed main headings and subheadings, which form the first part of that equation. Today, I’m writing about the second part, “where to find it,” also known as locators.

Locators are the portion of the entry which tells readers where to find information about the main heading and subheading. They are like directions for the reader to follow. To be effective, locators need to be clear, specific, and accurate. 

There are three points that I want to make about locators.

The first is that a locator can be anything. Page numbers are usually the default locator, especially when indexing books, but other forms of locators can also be used, such as paragraph or policy numbers. The only criteria is that the locator is appropriate for the material being indexed.

My second point is that the locator needs to be clear and specific. Readers should understand how the locator relates to the text, and should be able to easily use the locator to find the desired information.

To give a few examples, page numbers are often augmented when referring to figures and tables or to footnotes and endnotes. For figures and tables, the page number may be placed in italics or bold, or have a fig. or t appended. For notes, the note number is usually appended to the page number, as in 253n43 or 265nn14-15. These allow the reader to more quickly pinpoint the information on the page.

For a cumulative index for a journal or multivolume series, the locator should probably include a volume or issue number alongside the page number, as in VII.343. For a policy document that I update every couple of years, I use unique policy numbers instead of page numbers, as in 20.2.1.2. These policy numbers both direct readers to the specific policy more quickly (especially if there are ten or more policies listed per page), and makes updating the index a whole lot easier as I don’t need to worry about the pagination shifting as policies are added, removed, and revised. 

When augmenting page numbers or using something different for a locator, it may be helpful to explain your choice in a headnote so readers understand how to interpret the locator. Page numbers, being the default, don’t need to be explained.

To give a few examples of locators:

conduction, 83, 84fig. 

convection, 85, 84t, 234n43

safety protocols, 2.1.13.4, 3.3.1.12, 3.4.3.6

thermodynamics, VI.343, IX.23

The last point I want to make is that the number of locators in an array should be reasonable for readers to search. As a general rule of thumb, no more than 6-10 undifferentiated locators. Larger arrays, with more than ten locators, should have subheadings to sort the references and make it easier for the reader to identify the relevant aspects of the subject. The concern is that searching through a long string of undifferentiated locators is time-consuming and may discourage the reader from finding what they need. Better to set the reader up for success by presenting locators in smaller chunks.

 

When writing an entry, the main heading and, possibly, subheadings, tell readers what the subject is, while the locator tells readers where to find information about that subject. Locators can be anything, and should be clear and specific. The number of locators should also be reasonable for readers to search. Do this, and you are setting your reader up for success.

Article
0 comment

Telling Readers What This Thing Is

In my last post I wrote about entries and arrays, which I described as the building blocks of an index. I defined an entry as “what this thing is + where to find it.” Today I’m going to expand on “what this thing is,” also known as main headings and subheadings.

The fundamental purpose of an index is to guide readers to the relevant information that the reader is searching for. To do that, the index needs to be clearly written, which begins with the first words that the reader sees. 

Main Headings

The main heading, also known simply as the heading, kicks off the entry. This is the first word or phrase that you see in an entry and array. It is typically a noun, and should be clear and concise. If a longer phrase is needed, the main heading should lead with the most important element. The main heading should also match how the term is used in the text, such as using the same spelling and capitalization. 

The heading should be reflective of both the text and the audience. Is the book discussing cars more generally or electric vehicles specifically? Or both? Will readers be reading this book to learn about the auto industry, new innovations, or specific brands and models? Or all of the above? To give another example, biblical characters such as Matthew, Silas, and Timothy probably don’t need a gloss clarifying their identity in a work of biblical studies to clarify their identities, but these names may be more unfamiliar to readers if they appear in other disciplines. 

Another consideration is whether or not to pluralize main headings. Should it be dog or dogs? Cantaloupe or cantaloupes? To start, be mindful of differences in nuance. Freedom is somewhat different from freedoms, for example. Otherwise, I tend to follow common usage. If a term is commonly pluralized, then I go ahead and make it plural in the index, which I think reads more naturally. 

To give a few examples of main headings:

Acts (biblical book)

Cleveland, Grover

electric vehicles

heat transfer

London (ON)

trade wars, retaliatory

Subheadings

For short arrays containing a handful of locators, a main heading is usually sufficient to specify what this thing is. But more specificity is often needed for topics with extensive discussion (usually when there are more than 6-10 locators) or if there are different aspects that readers would appreciate differentiated.

The subheading is placed after the main heading. Its purpose is to further clarify what this thing is. Because subheadings often differentiate references from one another, there are usually multiple subheadings per array.

Since the subheading is appended, there is more flexibility in how it can be phrased. Depending on the context, the subheading can be either a short word or phrase, or it can be longer and more descriptive.  In all cases, the relationship between the main heading and subheading should be clear. If possible, I try to also lead with the key word, which both affects how the subheadings are alphabetically sorted and, I think, makes it easier for readers to find the subheading as they scan the array. 

For example,

Acts (biblical book): authorship; within biblical canon; commentaries on; Paul within

Cleveland, Grover: first presidency; free-silver issue; legislative achievements; private life between presidential terms; second presidency

heat transfer: conduction; convection

Effective headings and subheadings connect readers to the text. Main headings are the point at which readers encounter the index, and readers should not need to guess what this thing is. The same is true for subheadings, if the reader decides to read further into the array. 

Tell the reader what they need to know. Be specific and concise. Do this, and your index will be well on its way to being excellent.

Article
0 comment

The Building Blocks of an Index

An index is a document that is scanned to find information. It usually spans several pages. But if you had to break an index down into its smallest part, what would that be?

An index is not like most books or documents in that it does not contain a narrative. It cannot be reduced to plot points or the components of an argument. An index doesn’t even contain proper sentences. Instead, an index is a compilation of references. Broken down, the smallest unit within an index is an entry. 

An entry has two components. Basically, “what this thing is + where to find it.” Using indexing terminology, this is “main heading + locator.” Or, to add another level of specificity, “main heading + subheading + locator.” From the entry, the reader can identify what they are looking at and where to find it in the text. For example,

Foxconn, 45

semiconductors: geopolitics of, 67

The second building block is an array. I like to think of an array as containing everything that an index—and by extension the book or document—has to say about a particular subject. If you want to learn about Foxconn, you search for the Foxconn array. Want to learn about semiconductors, you search for the semiconductors array.

If there is only one mention, then a single entry can serve as a single array. But more often, there are multiple discusses throughout the book, which lead to the creation of multiple entries. Combined together, the entries create an array.

Foxconn, 45, 49, 51-52

semiconductors: fabrication techniques, 54-57; geopolitics of, 67; history of, 23-25; properties of, 34, 44

Why are entries and arrays so important? No one writes an index composed of a single entry.

But every index begins with an entry, and as the index is written, the entries and arrays accumulate. It is through knitting the entries and arrays together than an index emerges.

Step one to writing an index is to write clear, concise, and specific entries, so that “what this thing is” is clear to the reader. Step two is to combine entries into arrays which are clearly organized and easy to scan. Step three is to sort and organize the arrays—creating the structure of the index—so that the index as a whole is easy to navigate. 

Each of these elements—the entry and the array—fit together, like interlocking pieces, to create a coherent whole.

A Note about Terminology 

I’ve noticed that not every indexer, including books about indexing, distinguishes between entries and arrays. I’m guilty myself of using the terms interchangeably, though I try to be clear when I’m writing.

But while terminology varies, I do think that the distinction is important. Because an index is composed of hundreds or thousands of pieces of information, it helps to know what these pieces are and how they interact with each other. An index is also easier to edit and organize if these building blocks are clearly written and well thought out.

As you index, how are these building blocks fitting together? How can you be more mindful of each piece of information and how it interacts with the entries and arrays around it? Does it make a difference to think about indexing as building up from the smallest unit to the larger whole?

Article
0 comment

Goodbye 2024, Hello 2025

Happy New Year! How is this first full week of 2025 shaping up for you?

I am having a slow start. I enjoyed unplugging from email and work for two weeks. I’m also recovering from a cold.

December was crazy busy, wrapping up nine indexes, which included a children’s book, one project which was simply names and biblical references, and two scholarly books that I began in November. So not quite as much work as nine indexes may sound, but still more than I bargained for. That was due to a couple of projects slipping and arriving when I did not expect and also, if I’m honest with myself, I did accept too much for December and didn’t leave enough margin for complications. 2024 was an odd year and I was trying to make up some missed income, but I should have been more disciplined and said no to a couple of projects.

2024 in Review

2024 was all about buying a house and moving. That dominated everything else in my life, including indexing. I’m thankful for this house. It still feels a bit surreal that I have a permanent place to live. I have always lived in anticipation of needing to move again within the next 1-4 years, and so a part of me is waiting to move again, another part is reassuring myself that I can stay where I am, and a third part doesn’t know how to compute living in a place for longer than four years. One day at a time, I guess.

Moving, settling, and learning the maintenance rhythms for this particular house has also taken time. There were a few months over the summer when I was indexing part-time at best. That impacted the number of projects I was able to complete, along with my income, so I am especially thankful for Elim, my wife, who is earning more than me. 

Looking at the numbers for 2024:

  • 44 indexing projects (down from 52 in 2023)
  • 25 trade books, 18 scholarly books, plus an index for a policy document that I update every couple of years
  • Worked with 12 publishers (30 indexes), 12 authors (1 index each), 1 professional association and 1 policy institute (1 indexes each)
  • 34 projects were from clients within Canada, 8 projects from clients in the US, and 2 projects from clients in Australia.

Overall, my numbers are fairly similar to 2023, despite fewer projects. My ratio of trade to scholarly books is similar, as is projects from publishers vs. authors, and where projects are coming from. I am very thankful for the publishers I work with and who continue to send me work year after year. Having these positive relationships and not needing to actively market as much definitely makes my life easier. 

One interesting point: one publisher sent me 11 books to index in 2024, which is 25% of my total projects. All of the other publishers sent me between 1-3 projects each. While I always enjoy working with that one publisher and I hope to continue to do so, I think this speaks to the value of working with a wide range of publishers, especially if they are fairly small companies. One or two publishers are not enough to fill my schedule, but 10+ combined keeps me fairly busy.

Indexing Highlights

Do you ever notice that certain topics can appear in waves, with two or three books in a row on a similar subject? I find that often happens to me and I wonder why. Very serendipitous. 

In the past few months I indexed two trade books on the Empress of Asia and the Empress of Ireland. These were both ocean liners owned by Canadian Pacific Steamships. The Empress of Ireland collided with another ship and sank in 1914 in the St. Lawrence River, causing the death of 1,012 people, Canada’s worst maritime disaster. The Empress of Asia had a longer career, serving in both WWI and WWII, before being sunk by the Japanese near Singapore. Reflecting their different trajectories, Beneath Dark Waters: The Legacy of the Empress of Ireland Shipwreck, by Eve Lazarus (Arsenal Pulp Press, 2025) is tightly focused on the collision and aftermath, while Oceans of Fate: Peace and Peril Aboard the Steamship Empress of Asia, by Dan Black (Dundurn Press, 2025) follows the Asia over its 30 year lifespan. Both focus on the lives of the passengers and crew, which is what brings these harrowing stories to life. I recommend both if you enjoy maritime history. 

Taking the prize for the most unexpected subject matter is Dialectics of the Big Bang and the Absolute Existence of the Multiverse, by Gregory Phipps (University of Alberta Press, 2024). Did you know that the first second after the Big Bang can be divided into six epochs? Difficult to study empirically, and so Phipps uses Hegelian dialectics to unpack what may have happened. A fascinating read, and also very challenging to index. Philosophy is not my strong suit. Thankfully the author seemed happy with the result. 

Whither 2025?

For 2025, my keyword is margin. I want time to write, to work on the house, to spend time with Elim, and for exercise and rest. This means managing my indexing schedule so that I’m not constantly chasing deadlines and needing to work long hours. I still have an income target, which is a decent amount but also about 10% less than what my target has been in the past. And I’m okay with that. The pace of work hasn’t felt healthy or sustainable, and I don’t like feeling like there are areas of my life I am missing out on.

So here’s to finding a better balance in 2025.

Article
0 comment

Making the Index Invisible

So the 18th edition of the Chicago Manual of Style dropped in September. I have to admit I have not bought a copy. While I think their recommendations are solid, I find I don’t use it very much, since I only index and not edit. But I do know some editors who are very excited about the new edition, and there has been chatter among indexers as well on the changes to the chapter on indexing.

The main change in regards to indexing is 15.66, which states:

Chicago now prefers the word-by-word system of alphabetization over the letter-by-letter system (but will accept either in a well-prepared index).

 

I think this change makes sense.

I personally most notice the difference in sorting when indexing Asian studies books, where I tend to see a lot of surnames like Chen, Kim, and Liu. Being so short, these names often get mixed up with other headings when sorted letter-by-letter, whereas I think the index is easier to scan if all of the surnames are sorted together. I’ve also received instructions from a scholarly press to sort the index letter-by-letter except for the names, which the press wants force-sorted word-by-word. Which begs the question: why not sort the entire index word-by-word?

For example, here is a comparison of letter-by-letter compared to word-by-word.

Letter-by-letter sorting
Liang Ji
Liang Qichao
Libailiu (Saturday)
Li Boyuan
Li Chen
Life Weekly
Lin Meijing
Li Shirui
List, Friedrich
Liu Denghan
Liu Jiang
Liu, Jianmei
Liu, Lydia
Liushou nüshi (Those Left Behind; film)
Liuxuesheng (overseas Chinese students)
Liu Yiqing
Li Yuanhong
 
Word-by-word sorting
Li Boyuan
Li Chen
Li Shirui
Li Yuanhong
Liang Ji
Liang Qichao
Libailiu (Saturday)
Life Weekly
Lin Meijing
List, Friedrich
Liu Denghan
Liu Jiang
Liu, Jianmei
Liu, Lydia
Liu Yiqing
Liushou nüshi (Those Left Behind; film)
Liuxuesheng (overseas Chinese students)
 

The word-by-word sorting, for me, is a lot easier to scan and parse when like surnames are grouped together, and when names are sorted together above other terms. It makes me confident that I am seeing all of the names present, rather than being concerned that I am missing a name that is buried below.

Also note that the Liu names are sorted according to the clarified 15.85, which states:

When the same family name is inverted for one person but not for another (e.g., “Li Jinghan” and “Li, Lillian”), the names may be listed together and alphabetized by first names regardless of the comma.

 

This also makes a lot of sense and has been my practice for a long time. By ignoring the comma, the second portion of the name is treated equally for all names, whereas if the comma is taken into account, all the names with commas sort to the top and may cause some names to appear out of order. For example,

Liu, Jianmei, 48
Liu, Lydia, 91
Liu Denghan, 148n6
Liu Jiang, 105
Liu Yiqing, 27, 144n13
 

For another interesting comparison, as a colleague pointed out, try looking for the sorting differences in the indexes between the 17th and 18th editions of the CMOS. And if you’d like to see a full list of the changes to the indexing chapter in CMOS 18,see here.

So will I now unilaterally switch to word-by-word sorting for all of my clients who request that the index follows CMOS? I don’t think so, unless I think that the index will really benefit. I think it is better if I first ask my clients if they want to change, so we are both on the same page and I am not springing a surprise on them. And, to be honest, for most indexes I don’t think that the difference between word-by-word and letter-by-letter sorting will be that noticeable.

This brings me to my larger point, which is that the mechanics of a well written index should be invisible to the user. I doubt that any reader will browse the index and think, “I wonder what the alphabetical sort is?” That is not the reader’s concern. What the reader cares about is quickly finding information.

To facilitate finding information, every aspect of the index needs to work together. This includes the sorting, the structure, term selection, phrasing, and cross-references. When it works, the reader shouldn’t notice how the index works because the reader is too busy digging into the book. When the index does not work—that is the point when the reader is pulled out of the index and is frustrated at their inability to access the information they want. The reader may not be able to articulate whythe index is not working, but something about the contents and mechanics of the index is wrong.

Bringing this back to sorting, for many indexes the difference will be negligible between letter-by-letter and word-by-word sorting. As CMOS states, they will accept either in a well-prepared index. For other books, like for me with Asian studies titles, the difference will be more pronounced.

When indexing, pay attention to when the difference matters. Make decisions based on what will make the user experience the most seamless. Pay attention to how the different elements of the index fit together. Striving to make the index invisible may be an odd way to think about indexing, but to be invisible means that the index works, which is what we ultimately want for our readers.

Article
0 comment

Paying Attention to Terminology

I am writing today about some decisions that I needed to make on a recent index. In the grand scheme of the index, these decisions only affected a few entries. I am tempted to brush these off as not very important and not worth discussing. Yet much of indexing is about paying attention to the details without getting lost in the details. And I think this is a unique situation that illustrates an important point about term selection. At least, it made me sit up and think carefully as I was working.

A good index encapsulates two different goals, which can sometimes seem like they are in opposition to each other. The index needs to be both a reflection of what the author has written and be an attempt to clearly communicate with the reader. Lose one of these aspects, and the index ceases to function.

Term selection is key to achieving both of these goals. The terms used in the index need to both match the text and how the reader is likely to search. Ideally the author and the reader are in alignment, but sometimes the author uses different language than what the reader might expect. In those situations, the index may need to bridge the gap.

I recently ran into this issue when writing the index for Saint Paul the Pharisee: Jewish Apostle to All Nations, by Father Stephen De Young (Ancient Faith Publishing, 2024).

If you are familiar with Christianity, the title may be a hint that the author is taking a different tact with terminology. While Paul was a pharisee prior to his conversion, he is now more commonly known as the Apostle Paul, or Paul the Apostle. Yet here Fr. Stephen is emphasizing Paul’s Jewishness.

In the book’s Introduction, Fr. Stephen addresses this question of terminology:

Throughout this book, I have deliberately eschewed certain language. This language is certainly acceptable and has become the usual language of the Church. However, familiar terminology can sometimes be misleading. By using the word Messiah instead of Christ, community instead of church, or Torah instead of law, I hope to unsettle commonly held notions and help the reader reassess Paul in his historical context, rather than project the experience of present-day Christians into the past.

 

This shift in terminology also extends to names, which is where I noticed the biggest difference in regards to the index.

In addition to “Paul the Pharisee,” Fr. Stephen also frequently refers to Paul by his former name, Saul of Tarsus. Jesus is referred to as “Jesus of Nazareth,” rather than Jesus Christ. A figure such as the Apostle John, also known as John the Evangelist, John the Theologian, or John the Divine, is here referred to as John, the son of Zebedee. None of these names are incorrect, but they are names that are less commonly used. They support the author’s focus on Paul and the early Church’s Jewish context and alerts readers that the author is taking a different approach.

From an indexing standpoint, do I follow Fr. Stephen’s lead? By using these names, I would provide continuity with the text and reinforce the point that Fr. Stephen is trying to make. But will readers still recognize these names in the index, outside of the context of the text? I am not helping anyone if I include names and terms that readers are unlikely to recognize.

In the end, I decided to lean into the author’s terminology. Christians form the primary audience for this book and, I assume, are familiar enough with with these Biblical figures, even if these are not the names typically used.

Paul I simply indexed as “Paul.” As the subject of the book, I decided a gloss was unnecessary. I also included a See cross-reference from Saul of Tarsus, for any readers looking under Saul and to keep all discussions of Saul/Paul in a single array.

I indexed Jesus as “Jesus of Nazareth,” with a subheading for “as Messiah,” to reflect how the author discusses Jesus. I indexed the other Biblical figures as is (“Peter,” “Silas,” “Timothy,’ etc…) except for when a gloss or tag was needed to disambiguate (for example, “James, brother of the Lord’ and “James, son of Zebedee”). This is again following the author’s approach and trusting that readers will recognize these names.

I did, however, include glosses for several of the provinces and cities discussed, such as “Achaia (province)” and “Perge (city),” especially the less well-known places (I didn’t include glosses for cities like Athens and Rome). This may not have been necessary, but I personally like knowing where things are and what things are, so as a reader I would have appreciated the differentiation.

As I wrote at the beginning, these names form a small proportion of the overall index. Was it really worth spending time considering how best to balance the author’s approach versus reader expectations? There are plenty of other discussions in the book, such as discussions about Paul’s missionary journeys, the history of the early Church, and theological issues that Paul addresses in his epistles, that I also wanted to get right.

And yet names matter and terminology matters. The index would have presented a different message if I had used more conventional names for these figures and the index would have appeared disjointed from the text. Writing a good index is often about paying attention to the details so that the entire index works together as a whole and in conjunction with both the text and readers. The trick is to see both the details and the whole. It can be easy to lose sight of the big picture.

For this book, while the author opted to shift the terminology to make a point, I decided that most readers would still be able to follow along in the index. I didn’t need to include much in the way of signposts and clarifications. But for other books, extensive use of cross-references and glosses may be necessary. While reflecting the text and the author’s intentions, the index also needs to be responsive to readers. Thankfully, we have tools to bridge that gap.

The first step, though, is paying attention to the language used by the author. The next step is considering the audience. Do the two match? From here you can select terms and write an index that is clear and recognizable to all.

Article
0 comment

My Index Editing Process

Last time I wrote about reading like an indexer and what it is I do and look for when reading a text and writing the rough draft of an index. Today I’d like to reflect on my editing process.

A few months ago I started tracking my time when I index. I had previously done so, but not effectively and I eventually gave up. This time, I’ve created a new system and a new spreadsheet that is much easier to use, and I am a lot happier with the results.

One of my insights so far is that I spend about an equal amount of time drafting and editing. I have to admit that this surprised me. I knew that editing took up a fair amount of time, but I didn’t realize that the time spent is often about 50/50. For some indexes, I actually spend a little more time editing, making the time split closer to 45/55 or even 40/60.

Reflecting further on my process, I tend to spread drafting the index over 3-6 days, depending on the length of the book. Whereas I tend to edit within 2-3 days. When drafting, I am learning what the book is about. When editing, I am fully immersed in the index and I treat it more like a sprint. It probably also helps that by the time I get to editing, the deadline is looming.

I’m realizing that I also tend to draft quickly. I do try to write a fairly clean draft, taking into account context, clarity, and relevance, as I previously discussed. I believe in trying to set myself up for an easier edit. But I also know that this is not my final draft and that some things won’t become clear until I’ve read the whole book, and so I also try to keep moving.

Editing an index, for me, is both seeing the index as a whole and going through the index line by line. I like to give myself space between drafting and editing, which usually means sleeping on the draft and beginning to edit the next day. This helps to give me some distance so I can more clearly see the whole index with fresh eyes.

I usually begin by skimming the index, making note of the larger arrays for the metatopic and supermain discussions. This reminds me of the structure I am aiming for, and is a chance to consider if I want to make any major changes. I then start at the top of the index and work my way down, line by line. I know some indexers edit using multiple passes, each pass looking at a different element. I think I would go utterly cross-eyed and unable to make sense of the index if I tried multiple passes. Instead, my goal is to fully edit the array in front of me before I move on to the next. This may mean jumping around the index to also edit related arrays, and sometimes I will go back to re-edit an array if I change my approach, but generally speaking, I systematically move through the index.

With each array, I am first of all looking for clarity. Does the main heading and any subheadings make sense? If there are subheadings, I look to see if any can be combined or reworded, or if subheadings need to be added for unruly locators. I consider if anything needs to be double posted, and check to make sure that is done properly. I consider and check cross-references. I investigate any notes I may have left for myself. I also spot-check a few locators to make sure I understood the text properly. I may also run a quick search of the PDF to see if I missed any references. I don’t check every locator, which I think would be very time-consuming—to a certain extent, I need to trust that my drafting process was thorough and accurate—but these spot checks do provide peace of mind and I do sometimes find errors.

Reviewing arrays with no subheadings is usually quick, unless I’ve left a note for myself or I decide to spot check. Arrays with subheadings take more time. If an arrays has 20+ subheadings, I may spend as much as twenty or more minutes making sure that the array is in order. I often find the larger the book, the larger the index, the more subheadings there will be, and the longer editing will take.

Considering my process, I do wonder if I can shave off time. I could spot check a little less, especially for simple arrays with no subheadings, trusting that I picked up what was necessary. I can also pay more attention, when drafting, to larger arrays, so that editing them isn’t so onerous. I could also explore using more macros and patterns for batching tasks such as double-posting or removing subheadings. What I like about my process, though, is that it is thorough and I can clearly see what is completed and what is still to come. Editing line by line helps to keep my thoughts in order.

Other Approaches to Editing

My approach to editing is not the only approach, of course. I’ve mentioned making multiple passes. I also know of indexers who do a quick edit at the end of each day, while drafting, so that the draft is cleaner. I’ve also heard indexers who say that they do such a thorough job drafting that the editing process only takes them a couple of hours. I don’t know how that works for them. I seem to need a lengthier editing process for the index to gel and come together. And that’s okay. We are all different. What matters is that you find a process that works for you.

I find it interesting to hear how others index, even if it is not something I would do myself. I hope this glimpse into my process gives you something to think about.

Article
0 comment

Reading Like an Indexer

So you are sitting down to write an index. You scroll to the first page in the PDF, or, if you’ve printed out the proofs, you place the first page on the desk in front of you, and then…what? What is your thought process? How do you decide what entries to extract? How do you read?

Reading to index is different than reading to edit, reading to learn, or reading for pleasure. I think of reading to index as a process of disassembly. I try to identify how the author has written and structured the text, and I then pull apart all of those pieces, big or small, and reassemble them into the form of an index. This is very much an active reading, in which I am identifying, analyzing, and making decisions. 

I generally look for two types of information when I draft an index.

  • Specific details. These are names, places, companies, concepts, etc… that are explicitly mentioned and discussed. These are usually fairly obvious. If there are a lot of names or other such details, I may index a few pages, pick up these details, and then go back and re-read to make sure I also understand the larger discussion.
  • Broader topics. These range from the metatopic—what the whole book is about—to supermain and regular discussions—both themes spanning the book and what specific chapters or sections are about. It is important to have index entries which correspond to these broader discussions, and so in addition to picking up specific details, I try to also understand the big picture. These broader topics are also tied to the structure of the index, as I consider how best to reflect the book’s structure in the index, and as I anticipate that these large discussions will become large arrays, anchoring the index. Depending on the book, as mentioned, I may need to read a section two or more times to properly mine all relevant entries. 

Once I have identified the large and small pieces that the book is made of, I need to decide how to translate that into the index. Here are a few tips I find helpful to keep in mind.

  • Understand what you are reading. This may seem obvious, but I think it is worth stating. The temptation, at least for me, is to guess if I am unsure and to create an entry anyway. And sometimes guessing is the best I can do in that moment. I flag the entry for revisiting later and I move on. What can be more effective, though, is to read ahead a few pages until I do understand, and then go back and create the entry. It’s okay to be patient. Taking the time to understand can pay off later with better understanding of what comes next in the text and with less editing due to a stronger draft. 
  • Place the information in context. Are you looking at a specific detail or a broader topic? How does the detail or topic relate to other details or topics? Can this be turned into a subheading? Should it be double-posted? Is a cross-reference necessary? What other entries does this suggest? While subheadings, cross-references, and double-posts can all be revisited later, when editing the index, I like to start thinking about them while writing the rough draft. The information in the book is an interconnected web, which the index should reflect. So as part of your thought process, get in the habit of looking for these connections. 
  • Filter for relevance. In addition to understanding the larger context, also pay attention to relevance. Think about the audience before you begin writing the index. Consider how much space is available for the index. What should the index focus on? Sometimes I am not sure if an entry is relevant and so I pick it up anyway, labeling it for possible deletion later. But the more I can filter out now, the less I need to cut later. 
  • Communicate with clarity. This is especially true for subheadings. Make sure that readers understand what this entry means. Be concrete and, where relevant, link back to the larger context. You don’t want to leave readers guessing, nor do you want to leave yourself guessing when you come around again to edit.

All combined, this is a lot to do while reading and indexing. It can be difficult to identify both specific details and larger discussions, while also weighing relevance, and paying attention to the context, and thinking about related entries, and thinking about how best to phrase for clarity. Reading to index is a skill that takes practice.

Remember too that the rough draft does not need to be perfect. My drafts are certainly not perfect, and while I am thinking about all of this while drafting, I spend about an equal amount of time editing. 

How you read is up to you. I tend to start reading and I type entries into Cindex, the indexing software that I use, as the entries come to mind. Other indexers prefer to first mark up the proofs, identifying what is indexable and making notes for themselves, before they go back and type up the entries. There is no right or wrong approach, so long as you are paying attention to all aspects of the text, both big and small.

If you are newer to indexing, you may find marking up the proofs to be a good way to visualize or make concrete this thought process. I marked up proofs the first 3-4 years that I indexed, which in hindsight was necessary for me to engrain this way of reading. Once indexing started to become habit, I stopped marking up, though I still read ahead sometimes to better understand what the text is about. 

Writing an index is a unique way to interact with the text. It does require a shift in how you read and see the text. Once you make that shift, indexing becomes easier. 

Article
0 comment

When Subheadings Are Not So Useful

I love subheadings. They add so much to an index, breaking down long strings of locators into smaller chunks, highlighting meaning distinctions, and gathering related entries into lists so readers only need to search in one place. As I discuss in my last reflection, subheadings can also reflect the story that the text is telling. Well-written subheadings are clear, specific, and meaningful.

But…in indexing there is always a but. Occasionally, a project comes along that proves the exception. 

This happened with a recent index I wrote, for To See What He Saw: J.E.H. MacDonald and the O’Hara Years, 1924-1932, by Stanley Munn and Patricia Cucman (Figure 1 Publishing, 2024). J.E.H. MacDonald was a Canadian painter and a member of the Group of Seven. He fell in love with the landscape around Lake O’Hara, in the Rocky Mountains, and spent several summers there painting. This book takes an interesting approach to MacDonald. Over the course of almost twenty years, the authors sought to identify the exact locations where MacDonald painted. The bulk of the book is composed of a brief discussion of each of the O’Hara paintings, alongside a photograph of what the scene looks like today. The rest of the book is composed of an introduction, an overview of each of MacDonald’s eight trips, and excerpts from MacDonald’s diaries and other writings. The result is a beautifully illustrated coffee-table book. 

The instructions from the press were to only index the paintings, people, and places. While narrow in scope, there isn’t too much else discussed, and these are what readers are most likely to want to find, so I thought the instructions reasonable. Figure 1 Publishing is also very good at providing clear specifications for how long the index can be. For this book, the specs were 55-60 characters per line, for 675 lines total. 

I quickly realized that the book mentions a lot of paintings and places. The book discusses 226 paintings, almost all of them by MacDonald. With each painting taking up at least a line, some of them more, the paintings alone fill up about a third of the index. The rest of the index is mostly places—mountains, lakes, creeks, trails, huts—in and around Lake O’Hara that MacDonald either painted or visited. In comparison, only a few people are mentioned.

I also realized that the book contains a lot of repetition. For example, the same mountain may appear in a couple dozen different paintings. That mountain is mentioned again in the overviews of MacDonald’s trips, and then again in MacDonald’s diaries. This kind of repetition makes sense given how the book describes the same events and paintings from different angles, but it does mean that the mentions add up. Arrays with especially long strings of locators include Cathedral Mountain (49 page references), Hungabee Mountain (39 references), Odaray Bench (34 references), Lake McArthur (32 references), and Lake Oesa (27 references).

Normally, I would add subheadings to these arrays. Asking readers to look up each page reference is a big ask. But for this index, I left those strings, for paintings and places, intact. 

Not using subheadings was a conscious decision, and one I didn’t make lightly. My initial instinct was to find subheadings. But as I indexed and considered the entries, I also realized that subheadings would not be so useful in this particular index. Wanting a second opinion and to avoid surprising the press with a departure from my usual approach, I also queried the editor I was working with and got their approval.

I decided to not use subheadings for two reasons. One, I realized that too many subheadings would quickly make the index too long. Unfortunately, space constraints can sometimes mean putting aside the index that you want to write for the index that fits. In these situations, I need to be strategic about picking and choosing the subheadings that will have the biggest impact, while also being okay with other arrays not having subheadings. 

More importantly, though, for this book, I couldn’t think of subheadings that I was satisfied with. For subheadings to be effective, they need to clearly articulate additional information that readers can use to narrow their search. But what if there are no clear distinctions between locators? In that case, I think the long strings of locators should be left alone. It is not helpful to introduce artificial distinctions or to get so granular that context is lost. 

As I mentioned, this book contains a lot of repetition. Places either appear in MacDonald’s paintings, are places that MacDonald visited, or both. This doesn’t provide much to hang a wide range of subheadings. 

I briefly considered listings all of the paintings that each mountain or other feature appears in, along with a subheading for MacDonald’s presence at. For example, 

Cathedral Mountain: MacDonald at; in painting 1; in painting 2; in painting 3; in painting 4; in painting 5; etc…

But this approach presents a few problems. Some arrays would have been enormous, with a dozen or two subheadings for each of the paintings. Besides the space issue, I’m not convinced that listing each painting would have been meaningful to readers. Would readers remember the titles of individual paintings? In many cases, multiple paintings shared the same title. Thankfully, the authors give each painting a unique alphanumeric code, which I included in the index to differentiate. For example, “Lake O’Hara (25-1.3(S))” and “Lake O’Hara (30-3.1).” But I imagine it would still be difficult remembering which is which. Alternatively, I could have created a subheading for “in paintings,” but that would have still resulted in a long string of locators, as would the subheading “MacDonald at.” “MacDonald at” also isn’t very useful since readers can presumably assume that MacDonald was there, as that is the focus of the book. 

Given the space constraint and that either way—with a couple of generic subheadings or without subheadings—the arrays would have long strings of locators, I decided it was best to keep the arrays simple and to forego subheadings. This does mean that readers will need to search through each locator, though readers should also quickly notice the repetition, and it is all there for the dedicated searcher. 

This isn’t to say that I avoided subheadings entirely. I did use them in a few places, mostly for people, though even with people I found it difficult to avoid longer strings of locators. Many of these references are brief mentions and again reflect the repetition throughout the book. For example, here are two arrays for MacDonald’s friend, George Link, and wife, Joan.

Link, George K.K.
     about, 234, 343n83
     Lake O’Hara Trails Club and, 340n25
     MacDonald and, 82, 91, 104, 107, 113, 114, 143, 215, 224, 233, 234, 239, 243, 249, 252, 253, 254, 256, 258, 264, 294, 301, 307, 308, 310
     photographs, 246, 260

MacDonald, Joan
     encouragement from to travel west, 13, 202, 205
     letters to, 93, 96, 115, 120, 121, 122, 131, 167, 175, 200, 203, 204, 205–6, 211–12, 229, 230–31, 236, 240, 256, 265
     Links and, 341n47
     MacDonald’s departure west and, 250
     mentions in MacDonald’s diary, 304, 308
     O’Hara trip with MacDonald, 36, 123, 191, 217, 221, 222, 223–24, 259
     photo album, 224

While I highly encourage you to include subheadings and to make sure that subheadings are clear, specific, and meaningful, I think it is also worthwhile considering the exceptions to the rule. I hope that my approach to the index for To See What He Saw, about J.E.H. MacDonald’s paintings in and around Lake O’Hara, is helpful for considering when subheadings may not be useful. If there is a lot of repetition in the text, if it is difficult to find meaningful distinctions, and if there is a hard space constraint, then it is okay to have long strings of undifferentiated locators. It is not ideal, but it may still be the best solution for that particular text and index.

Article
0 comment

Indexing as Storytelling

What does the process of indexing consist of?

Is it primarily a process of extracting terms from the text? I’ve noticed, when talking to readers and editors, that this seems to be how many people conceptualize writing an index. It is less writing and more data mining. 

I want to suggest an alternative approach. While identifying and picking up key words is important, I don’t think that that is enough. A excellent index should also contain an element of storytelling. 

I also want to address a mistake that I’ve seen newer indexers make. It is being so focused on the myriad details of the text that they—and the index—lose sight of the larger context. I’ve noticed this especially in subheadings. I don’t think that this is an intentional mistake. Books contain such a tsunami of information that it can be difficult to know where to focus. This is why I prefer to spread the work out over several days, so my mind has time to process and absorb what I’ve read. If you struggle to keep the big picture in mind while indexing, I hope this reflection gives you some pointers.

As a caveat, while writing this I have in mind narrative-driven books. Think histories and biographies. These are books that typically contain a lot of detail while also telling a story. Other sorts of books, such as in the social sciences, how-to guides, or law, are more technical in nature and may not have an overt narrative (though every book should have some sort of structure). But even for more technical books, it is important to keep the larger context in mind. 

Context, Context, Context

In my book, Book Indexing: A Step-by-Step Guide, I discuss what I call the hierarchy of information. At the top is the metatopic, which is what the entire book is about. Below that are the supermain discussions, which are the handful of major arguments or areas of focus that comprise the metatopic. Below the supermain discussions are the regular discussions, which are the sub-discussions which flesh out the supermains. At the very bottom are all the little details, typically names, places, events, etc… All of these layers are nested together and should be reflected in the index.

When I refer to the context or the big picture of a book, I am talking about the metatopic, supermain, and regular discussions. These are the overarching discussions that give meaning to the smaller details. Problems can arise when the indexer fails to link back to the context, leaving readers confused about the meaning or relevance of an array or subheading.

Audience as Context

Before I jump into some examples at the subheading level, which is where I usually see this issue, I also want to mention that the book’s audience is also an important context. Not every detail is indexable. Before starting the index, think about what is important to the readers.

I’ve seen it happen when the indexer is so focused on picking up the details that they forget to assess whether the details are relevant. For example (and this is a made-up example, as I don’t want to embarrass anyone), say the book is a memoir on hiking Mount Everest. The author also briefly discusses, over a few paragraphs, a previous trip hiking Mount Kilimanjaro. Since it is discussed, Mount Kilimanjaro should have a main entry, but because the overall focus of the book—and presumably of readers—is on Mount Everest, the indexer does not need to pick up specific details about people and places associated with Mount Kilimanjaro. Those details are not relevant and will bloat the index. Instead, focus the index on Mount Everest. 

Using Subheadings to Tell a Story

Now let’s discuss subheadings.

Subheadings are crucial for breaking down large arrays into searchable chunks, but they are only effective if they are clearly written. Subheadings which are too granular and disconnected from their context are not helpful.

Consider this example. This is also made-up and is similar to real arrays that I have seen.  

Obama, Barack: communications with; congressional leaders and; economy and; Iraq and; oil and; as president; Senate and

How much do you understand about Obama from these subheadings? Do you have a clear sense of what you will find if you let these subheadings direct you?

To start, who is Obama communicating with? About what? It could be about anything. The subheadings “congressional leaders and’ and “Senate and” are a little more specific. If the text itself is vague or if these subheadings cover multiple interactions, then this level of vagueness may be appropriate. But what if we learn that these three subheadings are all referring to negotiations over the Affordable Care Act? Now we have context.

“As president” is also an unhelpful subheading, since most readers should know that Obama was president. Does it help to learn that the context is being elected during the 2008 presidential election? The other subheadings, for the economy, Iraq, and oil, may be clear enough, though it again depends on what the text is actually about. 

While all of these subheadings are technically correct—Obama is indeed communicating with someone, he is president, and he is doing something in regards to the economy—these subheadings also feel disconnected from anything concrete, at least to me. If we revise these subheadings to more accurately reflect the larger discussions, we get the following array. Which seems more connected to his presidency? Which is more helpful to readers?

Obama, Barack: 2008 presidential election; Affordable Care Act; economic policies; Keystone XL pipeline; withdrawal from Iraq

Let’s look at another example. Sometimes subheadings within an array are treated as a list, as in a list of names. 

Microsoft: Allen; Ballmer; Gates; Nadella; Wallace

These are all key players in Microsoft’s history. They are important and should all have main entries, but is listing them as subheadings really the best use of the Microsoft array? It doesn’t tell us much except that these people all have links to Microsoft. Why not use the subheadings to instead tell Microsoft’s story? Gates, Ballmer, and the others can still be in the index; just not the focus here. 

Rewriting the Book in the Index?

At this point, I can imagine a couple of objections. Is storytelling really appropriate within an index?

A common rule of thumb is to not rewrite the book in the index. I understand the point, that the index is supposed to direct readers to where the discussion actually is. But the index can only direct if the entries are clearly written. One of the best ways to be clear, in my opinion, is to connect to the larger context. I enjoy stories, and so I like to think of this as storytelling. If it helps, think of this as being clear and specific. What will resonate with the reader? Use that to hook the reader and send them in the right direction. 

It is also important to select the level of specificity that matches the discussion in the text. Returning to the Obama example, the Affordable Care Act is much more specific, and therefore more meaningful, than a generic subheading for healthcare policies. But if the discussion in the book is more like a broad overview, as in an overview of various economic policies, then a subheading at a broader level, like “economic policies,” would be the better choice.

Storytelling vs. Lists

I am also not saying that you should never make a list. Using subheadings to gather information into a list is also a valid approach. The two approaches can even be used in the same index. For example, for a book about Margaret Atwood, the main array for Atwood could tell the story of her life and career while a separate array—perhaps appended using the em-dash-modified format—could list all mentions of her novels and other writings. Books that are more technical in nature, rather than narrative-driven, may also favor lists over storytelling. The trick is knowing when each strategy is appropriate.

I also think that storytelling—making sure that the big picture is adequately represented in the index—can be more difficult to do, or at least more difficult to remember. It is summarizing and pointing towards the narrative and structure that exists within the book. Gathering together a list is often easier. 

When indexing, remember that you have options for how to present entries and information to the reader, and that your goal is to clearly communicate what the book is about.

Taking a Step Back

So how do you see the big picture? How do you channel that wave of information that is threatening to overwhelm you?

I find it helps to pause and take a step back. I especially do this if I feel like I’ve lost sight of the author’s argument or point. Or if there are a lot of names and other details and so it is easier to make two passes over that section, once for the details and a second time to see the full picture. I ask myself, “What is this discussion about, anyway?” Once I’ve identified the overarching discussion, I may need to go back and create entries for the context that I’ve missed.

If you struggle to see the big picture or the hierarchy of information, try to develop a habit of pausing and reflecting. Read until you hit a transition. Pause. How would you summarize the discussion you just read? How does the discussion fit into the larger structure or narrative of the book? Try completing the following sentence: “This section is about…” Be clear, specific, and meaningful. Once you have your answer, put it in the index.