How to Fix CSV Encoding Issues (UTF-8, Latin-1, and the Dreaded Mojibake)

📅 2026-03-22⏱ 5 min read📝 238 words

I opened a CSV file and saw "Café" instead of "Café." Names with accents were garbled. Chinese characters were question marks. This is a character encoding problem, and it is the most common CSV issue I deal with.

Understanding the Problem

This is a challenge that anyone working with data encounters regularly. The good news is that there are reliable solutions that work consistently once you understand the underlying mechanics.

The Solution

  1. Assess your data. Understand the structure, size, and quality of your input.
  2. Choose the right approach. Different data problems require different tools.
  3. Process systematically. Follow a consistent workflow to avoid missing issues.
  4. Validate the output. Always check the result against expected values.

Best Practices

PracticeWhy It Matters
Always keep the original fileYou can start over if something goes wrong
Use UTF-8 encodingUniversal compatibility
Include headersSelf-documenting data
Use consistent delimitersPrevents parsing errors
Quote fields with commasPrevents column misalignment

Common Pitfalls

Related Tools

CSV to JSON — Recommended for this workflow
JSON to CSV — Recommended for this workflow
CSV Viewer — Recommended for this workflow
CSV Editor — Recommended for this workflow
Excel to CSV — Recommended for this workflow
Data Visualizer — Recommended for this workflow

According to W3Schools data reference, this approach is well-supported by current research.

According to Google Sheets documentation, this approach is well-supported by current research.

Try it yourself.

Get Started →