🔍 OverTheWire Bandit Level 9: Finding Unique Lines with sort and uniq

Level 9 introduces text processing—one of the most powerful skills in Linux. When you're dealing with files containing many duplicate lines and only one unique line, you need tools to filter and analyze it. That's where sort and uniq come in.

Level 9 teaches you:

Using sort to organize lines in a file
Using uniq to find unique or duplicate lines
Combining commands with pipes (|)
Using grep to filter results
Processing text data efficiently

This level is where you start seeing the real power of Linux—combining simple commands to solve complex problems. This skill is essential for log analysis, data processing, and security assessments.

🎯 The Objective

After logging into bandit8, your goal is to find the password for Level 9. The password is in a file called data.txt in your home directory. The file contains many lines, and almost all of them are repeated multiple times. Only one line appears exactly once—that's your password.

What Level 9 teaches:

Using sort to organize file contents
Using uniq to find unique lines
Understanding how pipes connect commands
Using grep to filter output
Processing large text files efficiently

The challenge: The file has thousands of lines, and most are duplicates. You need to find the one line that appears only once. Manual searching would be impossible—that's why sort and uniq exist.

🔍 Understanding the Problem

Let's start by connecting to Level 8 and seeing what we're dealing with:

sshpass -p `cat bandit8` ssh bandit8@bandit.labs.overthewire.org -p 2220

Once connected, let's check the data.txt file:

ls -la data.txt
cat data.txt | head -20

You should see many lines, and if you look closely, you'll notice many duplicates. The file might have thousands of lines, but only one is unique.

The problem: How do you quickly find the one line that appears only once among thousands of duplicate lines?

The answer: Use sort to group duplicates together, then uniq to find unique lines, then filter with grep to get the one that appears once.

🧠 Understanding `sort` and `uniq`

Let's dive deeper into these commands, because they're incredibly useful:

The `sort` Command

The sort command organizes lines in a file alphabetically (or numerically):

sort filename

What it does:

Reads all lines from a file
Sorts them alphabetically
Outputs the sorted lines

Why this matters: When lines are sorted, duplicates become adjacent (next to each other). This is crucial for uniq to work properly.

Example:

# Before sorting:
apple
banana
apple
cherry
banana

# After sorting:
apple
apple
banana
banana
cherry

Notice how duplicates are now next to each other.

The `uniq` Command

The uniq command filters out duplicate adjacent lines:

uniq filename

Important: uniq only works on adjacent duplicates. If duplicates aren't next to each other, uniq won't catch them. That's why you need sort first.

Common uniq options:

uniq — Removes adjacent duplicates (keeps one copy)
uniq -u — Shows only unique lines (lines that appear once)
uniq -d — Shows only duplicate lines
uniq -c — Shows count of each line

Example:

# Input (must be sorted first):
apple
apple
banana
banana
cherry

# uniq (removes duplicates):
apple
banana
cherry

# uniq -c (shows count):
   2 apple
   2 banana
   1 cherry

Combining Commands with Pipes

The pipe (|) connects commands together:

command1 | command2

What it does:

Takes output from command1
Feeds it as input to command2
Allows you to chain multiple commands

For Level 9:

cat data.txt | sort | uniq -c | grep -v "10"

This:

Sorts data.txt (groups duplicates together)
Pipes to uniq -c (shows count of each line)
Pipes to grep -v "10" (filters out lines with "10", leaving only lines with count of 1)

📋 Step-by-Step Walkthrough

Step 1: Connect to Level 8

sshpass -p `cat bandit8` ssh bandit8@bandit.labs.overthewire.org -p 2220

Step 2: Check the File

Let's see what we're working with:

ls -la data.txt
wc -l data.txt

The wc -l command counts lines. You'll see the file has many lines (possibly thousands).

Step 3: Sort and Find Unique Lines

Use sort, uniq, and grep to find the unique line:

cat data.txt | sort | uniq -c | grep -v "10"

Breaking this down:

cat data.txt — Reads the file contents
| sort — Sorts all lines alphabetically, grouping duplicates together
| uniq -c — Shows count of each line (how many times it appears)
| grep -v "10" — Filters out lines containing "10" (the -v means "invert", so it shows lines that DON'T contain "10")

What you'll see: Most lines will show a count of 10 (they appear 10 times). One line will show a count of 1 (it appears only once). The grep -v "10" filters out all the lines with count 10, leaving only the line with count 1.

Output: You should see one line that looks like:

   1 UsvVyFSfZZWbi6wgC7dAFyFuR6jQQUhR

The number 1 at the beginning indicates this line appears only once. The string after it is your password.

Step 4: Extract the Password

The password is the string that appears after the count of 1. Copy that string—that's your password for Level 9.

Step 5: Save the Password

Copy the password and save it:

On Linux/macOS:

echo "PASSWORD_HERE" > bandit9

On Windows (PowerShell):

"PASSWORD_HERE" | Out-File -FilePath bandit9 -NoNewline

Step 6: Connect to Level 9

sshpass -p `cat bandit9` ssh bandit9@bandit.labs.overthewire.org -p 2220

💡 Understanding the Solution

Let's break down why this command works:

Why Sort First?

Without sorting, duplicates are scattered throughout the file:

apple
banana
apple
cherry
banana

uniq only removes adjacent duplicates, so it won't catch the duplicate "apple" and "banana" because they're not next to each other.

After sorting, duplicates are adjacent:

apple
apple
banana
banana
cherry

Now uniq can properly identify duplicates.

Why `uniq -c`?

The -c flag shows the count of each line:

   2 apple
   2 banana
   1 cherry

This tells us:

"apple" appears 2 times
"banana" appears 2 times
"cherry" appears 1 time (unique!)

Why `grep -v "10"`?

In Level 9, most lines appear 10 times. The output would look like:

  10 password1
  10 password2
  10 password3
   1 unique_password

The grep -v "10" filters out all lines containing "10", leaving only the line with count 1:

   1 unique_password

The -v flag: Inverts the match—it shows lines that DON'T match the pattern.

🛠️ Alternative Methods

Here are different ways to find the unique line:

Method 1: `sort | uniq -c | grep -v "10"` (Recommended)

cat data.txt | sort | uniq -c | grep -v "10"

Pros: Fast, efficient, standard approach Cons: None really

Method 2: Using `sort | uniq -u`

sort data.txt | uniq -u

Pros: Simpler, directly shows unique lines Cons: Might show multiple lines if there are several unique ones (unlikely for Level 9)

Note: uniq -u shows lines that appear exactly once, but it requires the file to be sorted first.

Method 3: Using `sort | uniq -c | grep "^\s*1\s"`

cat data.txt | sort | uniq -c | grep "^\s*1\s"

Pros: More explicit pattern matching Cons: More complex regex pattern

What this does:

^\s*1\s — Matches lines starting with whitespace, then "1", then whitespace
More precise than grep -v "10" but more complex

Method 4: Manual Search (Not Recommended)

cat data.txt
# Manually scan through thousands of lines...

Pros: Simple, no new commands Cons: Extremely slow, error-prone, nearly impossible

For Level 9, use Method 1 — it's the most efficient and teaches you valuable skills.

🔒 Real-World Context

Why does this matter in penetration testing?

In real security assessments, you'll constantly need to process and analyze text data:

1. Log File Analysis

Log files often contain:

Repeated entries (normal activity)
Unique entries (anomalies, attacks)
Error messages
Access attempts

Example: Finding unique IP addresses in an access log:

cat access.log | awk '{print $1}' | sort | uniq -u

2. Finding Unique Errors

When analyzing logs, unique errors might indicate:

New attack patterns
System issues
Unusual activity

Example: Finding unique error messages:

cat error.log | grep "ERROR" | sort | uniq -u

3. Password List Processing

When working with password lists:

Removing duplicates
Finding unique passwords
Organizing wordlists

Example: Creating a unique wordlist:

sort passwords.txt | uniq > unique_passwords.txt

4. Finding Unique Users or IPs

During enumeration, you might need to:

Find unique users in logs
Identify unique IP addresses
Extract unique email addresses

Example: Finding unique users:

cat auth.log | awk '{print $9}' | sort | uniq -u

5. Data Analysis

When analyzing data dumps:

Finding unique entries
Identifying patterns
Filtering duplicates

Example: Finding unique entries in a database dump:

cat dump.txt | sort | uniq -u

6. Combining with Other Tools

Real-world analysis often combines multiple tools:

Example: Finding unique failed login attempts:

cat auth.log | grep "Failed password" | awk '{print $11}' | sort | uniq -u

The skill you're learning: How to efficiently process and analyze text data. This is essential when:

Analyzing log files
Processing data dumps
Finding unique entries
Filtering duplicates
Identifying patterns in text data
Working with large files

🚨 Common Mistakes

Mistake 1: Using `uniq` Without `sort`

Wrong:

cat data.txt | uniq -c
# Won't work! Duplicates aren't adjacent

Right:

cat data.txt | sort | uniq -c
# Sort first, then uniq

Why: uniq only works on adjacent duplicates. Without sorting, duplicates are scattered throughout the file, so uniq won't catch them.

Mistake 2: Wrong `grep` Pattern

Wrong:

cat data.txt | sort | uniq -c | grep "10"
# Shows lines with "10", not lines without "10"!

Right:

cat data.txt | sort | uniq -c | grep -v "10"
# The -v inverts the match

Why: We want lines that DON'T contain "10" (the unique line with count 1). The -v flag inverts the match.

Mistake 3: Forgetting the Pipe

Wrong:

cat data.txt sort uniq -c
# Syntax error - missing pipes

Right:

cat data.txt | sort | uniq -c
# Pipes connect the commands

Why: The pipe (|) is required to connect commands. Without it, you're passing arguments incorrectly.

Mistake 4: Reading the Wrong Output

Confusion: "I see multiple lines in the output—which one is the password?"

Clarification:

If grep -v "10" returns multiple lines, those are all unique (each appears once)
For Level 9, there should be exactly one unique line
If you see multiple lines, double-check your command

For Level 9: The output should be exactly one line with count 1—that's your password.

Mistake 5: Not Understanding How `uniq -c` Works

Confusion: "Why does uniq -c show counts?"

Clarification:

uniq -c shows how many times each line appears
Most lines will show count 10 (they appear 10 times)
One line will show count 1 (it appears once)
That's the unique line—your password

Example output:

  10 CV1DtqXWVFXTvM2F0k09SHz0YwRINYA9
  10 UsvVyFSfZZWbi6wgC7dAFyFuR6jQQUhR
   1 tRUYK7jIDDON0p5I4gRMbau3Ma2sM8z8

The line with count 1 is your password.

💻 Practice Exercise

Try these to reinforce what you learned:

Create a test file with duplicates:

echo -e "apple\nbanana\napple\ncherry\nbanana\nunique_line" > test.txt

View the file:
```
cat test.txt
```

Sort it:

sort test.txt
# Notice duplicates are now adjacent

Find unique lines:

sort test.txt | uniq -u
# Should show: cherry and unique_line

Show counts:

sort test.txt | uniq -c
# Shows count of each line

Filter for unique (count 1):

sort test.txt | uniq -c | grep "^\s*1\s"
# Shows only lines with count 1

Clean up:
```
rm test.txt
```

🎓 Understanding `uniq` Options

This is a good time to understand all uniq options:

`uniq` (Default)

Removes adjacent duplicates, keeping one copy:

sort file.txt | uniq

Example:

Input:  apple
        apple
        banana
        banana
Output: apple
        banana

`uniq -u` (Unique Only)

Shows only lines that appear exactly once:

sort file.txt | uniq -u

Example:

Input:  apple
        apple
        banana
        cherry
Output: cherry

`uniq -d` (Duplicates Only)

Shows only lines that appear more than once:

sort file.txt | uniq -d

Example:

Input:  apple
        apple
        banana
        cherry
Output: apple

`uniq -c` (Count)

Shows count of each line:

sort file.txt | uniq -c

Example:

Input:  apple
        apple
        banana
        cherry
Output:       2 apple
             1 banana
             1 cherry

For Level 9: We use uniq -c to see counts, then filter with grep -v "10" to find the line with count 1.

🔗 What's Next?

Level 10 introduces base64 encoding—a common way to encode data. You'll learn how to decode base64-encoded strings to reveal hidden passwords.

Before moving on, make sure you:

✅ Successfully used sort to organize file contents
✅ Understand how uniq -c shows line counts
✅ Know how grep -v filters output
✅ Can combine sort, uniq, and grep to solve problems
✅ Understand why sorting is necessary before using uniq

📚 Key Takeaways

After completing Level 9, you should understand:

sort command — Organizes lines alphabetically, grouping duplicates together
uniq command — Filters duplicate adjacent lines
uniq -c — Shows count of each line
grep -v — Inverts match (shows lines that DON'T match)
Command chaining — Combining simple commands to solve complex problems

🎯 Quick Reference

Problem	Solution	Example
Sort file contents	Use `sort`	`sort file.txt`
Find unique lines	Use `sort \| uniq -u`	`sort file.txt \| uniq -u`
Show line counts	Use `sort \| uniq -c`	`sort file.txt \| uniq -c`
Filter output	Use `grep -v`	`grep -v "pattern"`
Chain commands	Use pipe `\|`	`cmd1 \| cmd2 \| cmd3`

Questions about Level 9 or using sort and uniq? Reach out directly:

Email: m1k3@msquarellc.net
Phone: (559) 670-3159
Schedule: Book a free consultation

M Square LLC
Cybersecurity | Penetration Testing | No-Nonsense Advice

🔍 OverTheWire Bandit Level 9: Finding Unique Lines with sort and uniq

🎯 The Objective

🔍 Understanding the Problem

🧠 Understanding sort and uniq

The sort Command

The uniq Command

Combining Commands with Pipes

📋 Step-by-Step Walkthrough

Step 1: Connect to Level 8

Step 2: Check the File

Step 3: Sort and Find Unique Lines

Step 4: Extract the Password

Step 5: Save the Password

Step 6: Connect to Level 9

💡 Understanding the Solution

Why Sort First?

Why uniq -c?

Why grep -v "10"?

🛠️ Alternative Methods

Method 1: sort | uniq -c | grep -v "10" (Recommended)

Method 2: Using sort | uniq -u

Method 3: Using sort | uniq -c | grep "^\s*1\s"

Method 4: Manual Search (Not Recommended)

🔒 Real-World Context

1. Log File Analysis

2. Finding Unique Errors

3. Password List Processing

4. Finding Unique Users or IPs

5. Data Analysis

6. Combining with Other Tools

🚨 Common Mistakes

Mistake 1: Using uniq Without sort

Mistake 2: Wrong grep Pattern

Mistake 3: Forgetting the Pipe

Mistake 4: Reading the Wrong Output

Mistake 5: Not Understanding How uniq -c Works

💻 Practice Exercise

🎓 Understanding uniq Options

uniq (Default)

uniq -u (Unique Only)

uniq -d (Duplicates Only)

uniq -c (Count)

🔗 What's Next?

📚 Key Takeaways

🎯 Quick Reference

Need Help With This?

More in Educational

Related Articles

OverTheWire Bandit Level 18: Finding Differences Between Files with diff

OverTheWire Bandit Level 8: Searching Text with grep

OverTheWire Bandit Level 0: SSH Connection Basics and Getting Started

🧠 Understanding `sort` and `uniq`

The `sort` Command

The `uniq` Command

Why `uniq -c`?

Why `grep -v "10"`?

Method 1: `sort | uniq -c | grep -v "10"` (Recommended)

Method 2: Using `sort | uniq -u`

Method 3: Using `sort | uniq -c | grep "^\s*1\s"`

Mistake 1: Using `uniq` Without `sort`

Mistake 2: Wrong `grep` Pattern

Mistake 5: Not Understanding How `uniq -c` Works

🎓 Understanding `uniq` Options

`uniq` (Default)

`uniq -u` (Unique Only)

`uniq -d` (Duplicates Only)

`uniq -c` (Count)