Decoding Human Evolutionary Relationships: Cats vs. Dogs - A Comparative Genomics Guide
Overview
Have you ever wondered whether your family tree is more tangled with Fluffy or Fido? The question "Are humans more closely related to cats or dogs?" is a classic puzzle in evolutionary biology. The answer isn't as straightforward as checking your pet's ancestry—it depends on how you define "closely related." In this tutorial, you'll learn how scientists use genomic data and phylogenetic trees to compare evolutionary distances. By the end, you'll discover that while cats and dogs share a more recent common ancestor with each other than either does with humans, humans are roughly equally related to both. But don't take our word for it—you'll build a phylogenetic tree yourself to see the evidence.

Prerequisites
Required Knowledge
- Basic biology: Understanding that DNA carries genetic information and that species evolve from common ancestors.
- Familiarity with sequence data: Knowing what FASTA format looks like (a text file with header lines starting with '>' and sequences in uppercase letters).
- Command-line comfort (optional): For some tools, you may need to run commands in a terminal.
Software & Tools
- NCBI website (https://www.ncbi.nlm.nih.gov) – to download sequence data.
- Clustal Omega (online tool or standalone) – for multiple sequence alignment.
- MEGA X (free, cross‑platform) – to build and visualize phylogenetic trees.
- A text editor (e.g., Notepad++, VS Code) – to prepare FASTA files.
Step-by-Step Instructions
1. Decide on a Molecular Marker
To compare evolutionary relationships, we need a gene or genomic region that is present in all species and evolves at a manageable pace. A common choice is the cytochrome b gene (mitochondrial DNA) because it's conserved enough to align but variable enough to show divergence. For this guide, we'll use a small simulated dataset representing ~500 base pairs of cytochrome b from human (Homo sapiens), cat (Felis catus), and dog (Canis lupus familiaris).
2. Collect Sequence Data
Go to NCBI Nucleotide and search for cytochrome b human. Download the FASTA record for a reference sequence (e.g., NC_012920.1). Repeat for cat and dog: cytochrome b cat (e.g., NC_001700.1) and cytochrome b dog (e.g., NC_002008.4). Save each in a single text file, cytochrome_b.fasta, with headers like:
>human
ATGAC... [sequence]
>cat
ATGGG...
>dog
ATGAC...
(Note: In this tutorial, we'll use a publicly available pre‑aligned dataset from the UCSC genome browser to save time.)
3. Align the Sequences
Open the Clustal Omega web interface. Upload your FASTA file or paste the sequences. Click "Submit." After a few seconds, you'll get an alignment in both ClustalW and FASTA format. Download the aligned FASTA. This alignment ensures that homologous bases are compared—a critical step.
4. Build a Phylogenetic Tree
Launch MEGA X. Choose File → Open a File/Session and select your aligned FASTA. MEGA will prompt you to specify the data type (Nucleotide). Once loaded, go to Phylogeny → Construct/Test Maximum Likelihood Tree. Select the HKY+G model (a common DNA substitution model) and run with 500 bootstrap replicates. The tree will appear with branch lengths proportional to substitutions per site.

5. Interpret the Tree
Examine the tree. You will see a pattern: (human, (cat, dog)) if rooted with an outgroup. This means cat and dog share a more recent common ancestor than either does with human. The branch lengths show that the human–cat and human–dog distances are similar, confirming our earlier statement. To quantify, compute the sum of branch lengths from the common ancestor of all three to each tip—they will be nearly identical.
Common Mistakes
Mistake 1: Using Non‑Homologous Sequences
If you accidentally include a pseudogene or a nuclear copy of a mitochondrial gene, the alignment will be wrong. Always verify that your sequences are annotated as the same gene (e.g., MT‑CYB).
Mistake 2: Ignoring Bootstraps
A tree without bootstrap values is just a hypothesis. Bootstraps indicate statistical support. Values below 70% should be treated with caution. In our cat/dog/human tree, the cat‑dog clade should have >95% support.
Mistake 3: Misinterpreting Branch Lengths
Branch lengths represent evolutionary change, not time. Two lineages may have different rates (e.g., faster in rodents). For our question, we rely on the topology (shape) of the tree, not absolute distances, to decide which species are more closely related.
Mistake 4: Forgetting the Outgroup
An unrooted tree shows relationships but not direction. Always include an outgroup (e.g., a marsupial like opossum) to root the tree and polarize the characters. In our example, we rooted using a distant mammal sequence.
Summary
By building a phylogenetic tree using cytochrome b sequences, you can see that cats and dogs are more closely related to each other than either is to humans. In fact, humans are equally distant from both cats and dogs when you measure the total amount of evolutionary change along the branches. This tutorial showed you the complete workflow: from downloading sequences, aligning them, to constructing and interpreting a maximum likelihood tree. Now you can apply this method to any set of species and answer questions about evolutionary kinship yourself.