New 42-day free trial
Smarty

Pros and cons of the fuzzy address matching algorithm

John Hickey
John Hickey
 | 
January 17, 2023
Tags
Fuzzy address matching head image

Typos, typoes or typo's?

No matter how you spell the word, typos are a problem. Even the most accurate typist occasionally misspells a word, leaves off a letter at the end, or just types a different word than intended.

Typos in a text to your friend aren't a big deal. They'll figure out what you mean using context clues. But in a business data scenario, if you're following the logic of traditional address matching, where an address is either an exact match or not a match, you can't account for different formats or misspellings.

In this blog, we'll cover the following:

Exact address-matching methods won't catch common mistakes like

  • Spelling mistakes
  • Incorrect type (street vs. road)
  • Abbreviations
  • Synonyms: Floor vs. level -Unit, flat, or apartment
  • Swapped letters
  • Sounds like
  • Out-of-correct-order

However, fuzzy logic address matching excels at handling common address mistakes

Fuzzy matching (also known as approximate string matching) is a technique that helps identify two elements of text, strings, or entries that are approximately similar but not 100% the same.

Traditional address-matching logic is binary. An address either matches exactly, or it doesn't.

Fuzzy logic indicates the degree to which a statement is true, and can help identify errors like the list above. However, there are nuances to fuzzy matching. Being "too fuzzy" creates matches for addresses that deserve to fail, resulting in false positives.

To avoid this, you can choose your tolerance level, or what degree of similarity you'll accept, from the fuzzy matching algorithm. For example, a provider can set a threshold of 0.8 and then test and adjust as needed. The goal is to ensure that minimal false positives get through, while still returning results and allowing for misspellings, errors, and varying formats.

Here at Smarty, we have a multi-stage process that attempts to find a match using a reliable matching engine. If it doesn't find a match, that address is next run through more aggressive matching processes. By only running failed addresses through additional stages, match rates increase with minimal risk of matching addresses that deserve to fail.

Download the 4 Steps to Unlocking Latent Revenue and Driving ROI with Your Address Data ebook

Pros of fuzzy address matching

Fuzzy logic offers several advantages compared to traditional address matching. Let's look at a few of them.

1. Determining the likelihood of a full match

Fuzzy matching predicts the likelihood of a full match. Depending on how high the probability of a match is, you can decide to consider those two addresses a match or adjust your tolerance settings. Your match probability scores are also a good indicator of your data quality.

2. Handling misspellings, formatting errors, and phonetic variations

People often spell addresses based on how the street name sounds. Traditional address matching won't catch that Fontainebleau Drive is the same street as Fountainblue Drive or Fontinblow Drive. Fuzzy logic is much better at handling misspellings and minor differences because it looks at how many characters are different.

Without the ability to easily pair up the same address, written in two different ways, it's harder to deduplicate address data and merge your records.

3. Distinguishing unique address information

You can program your exact matches to accept some small, common variances in standard formatting (i.e., street vs. st, road vs. Rd, etc.). However, that won't help with non-standard address information or street names. You can't sit there and program in ten different spellings of Fontainebleau! Fuzzy logic allows you to set rules for address standardization, correcting address information like street names to match with greater accuracy.

Cons of fuzzy address matching

Nobody's perfect, and neither are fuzzy matching algorithms. Depending on your tolerance level, you might not catch the craziest misspellings or wacky formatting errors. However, fuzzy address matching addresses many weaknesses demonstrated by traditional address matching.

Powered by fuzzy address matching, better match rates can increase your revenue by providing more usable address data and minimizing time spent on manual validation.

Want to know more about how accurate address data can help your business? Download our free ebook, 4 Steps to Unlocking Latent Revenue and Driving ROI with Your Address Data.

Download the free ebook today

Download the 4 Steps to Unlocking Latent Revenue and Driving ROI with Your Address Data ebook

Subscribe to our blog!
Learn more about RSS feeds here.
rss feed icon
Subscribe Now
Read our recent posts
Improving user/customer experience in every industry with clean address data
Arrow Icon
You finally track down an essential addition to your collector’s set of [insert item of your choice], and you're hyped to buy it until the chaos begins. The cart is hidden in a fly-out on the side, cluttered with blocky, overwhelming text. You spend way too long just trying to find the "Proceed to Checkout" button. 👎 That’s bad UI (user interface): messy, confusing design that makes navigation a chore. You make it to the checkout and start entering your info, but the site keeps rejecting your address.
Dashboard essentials for Smarty users
Arrow Icon
The Smarty dashboard is your central hub for managing address verification, geocoding, and property data services. Whether you're just starting or looking to optimize your current setup, understanding the dashboard's full capabilities can significantly streamline your address data operations. We recently held a webinar in which we reviewed all of the Smarty dashboard's items and features. Missed it? That's OK; we've got all the information right here. You can expect to read about:Accessing your dashboardSetting up your account for successUnderstanding your active subscriptionsManaging API keys effectivelyStreamlining billing and financial managementStaying informed with smart notificationsTeam management and access controlsWeb toolsMaking the most of free trialsKey takeawaysLet’s get going!Accessing your dashboardGetting to your dashboard is straightforward.
Take charge of your API usage with Smarty’s key management features
Arrow Icon
Ever wondered, “Where did all my lookups go?!” Without proper API management, you may burn through your lookups quicker, experience runaway code, and encounter unexpected usage. That’s why Smarty created usage by key (included in all annual plans) and limit by key (included in some plans; you can add them by contacting sales) for its APIs. Why key management mattersCommon API usage challenges (problems to solve):Unexpected spikes in lookupsDifficulty tracking specific key usageWhich keys are calling which Smarty licenseNeed for better control over API consumptionDifficulty allocating Smarty lookups across an organizationWith Smarty's key management features, you gain more control by having better visibility of your usage, eliminating the element of surprise (you know, the bad kind, like when you’re suddenly out of lookups).

Ready to get started?