Chapter 02

Defining, Describing, and Visualizing Sequences

TOC Content (click for jumping to examples)
2.1 Basic Concepts and Terminology Sequence data formats
2.2 Defining Sequences Sequence length and granularity
2.3 Description of Sequence Data I Basic Description
2.4 Visualization of Sequence Data Color palette: Grayscale Edition
Color palette: Colored Edition
2.4.1 Data summarization graphs
2.4.2 Data representation graphs
ggplotify sequence data plots (bonus material)
2.5 Description of Sequence Data II Unidimensional and composite indices

Chapter 2 introduces the basic concepts and discusses how to define sequence data. The corresponding material provided on this site shows {TraMineR}’s capabilities of processing different data formats. We further illustrate how the alphabet and the granularity of sequences can be easily changed with {TraMineR}’s built-in helper functions.

With visualization being one of the key features of sequence analysis, we provide extensive material on optimizing the quality of figures by choosing appropriate color palettes. Although we highly recommend using colors for visualizing sequences, the associated print-costs are often prohibitively high (for instance, in the case of our book). We therefore provide some material illustrating how to produce acceptable results by using a grayscale color palette and by adding texture.

We further show useful commands for a comprehensive description of the sequence data. Finally, we turn to more advanced composite measures of sequence complexity. These indices are not only interesting in themselves, but also because they can be used either as independent or as dependent variable in subsequent regression analyses.

Corrections

If you see mistakes or want to suggest changes, please create an issue on the source repository.

Reuse

Text and figures are licensed under Creative Commons Attribution CC BY-NC 4.0. Source code is available at https://github.com/sa-book/sa-book.github.io, unless otherwise noted. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".