Fox Valley Microsoft Data Platform

 Welcome to FoxPASS, the SQL Server community for Northeast Wisconsin! 

Next Meeting: Wed, Sep 05 2018

When Low-Quality Data Strikes with Jared Kuehn

Language: English
Event Type: In-Person & Online
Online Meeting URL: https://meet.lync.com/talavant/turner.kunkel/93K6T64D?sl=1
RSVPURL: https://www.meetup.com/Fox-Valley-Microsoft-Data-Platform/events/253604024/

Please join us for Jared Kuehn's presentation on When Low-Quality Data Strikes: Fuzzy Tools Provide Clarity in Matching and Deduplication.


A little bit about Jared:

"Jared has over 5 years of experience working in the business intelligence field. He has had the pleasure of working in varied industries including higher education, insurance, healthcare, and retail. While he is newer to presenting on technology, he enjoys spending much of his free time presenting himself in theater productions and music groups throughout the Fox Valley and Milwaukee."

 

See you there!

When

iCal
Event Time : Wed, Sep 05 2018 17:30 - 19:30 Central Daylight Time
Your Local Time: Wed, Sep 05 2018 22:30 - 00:30

Where

4351 W College Ave
Appleton, Wisconsin

Direction: Suite 210 (Second Floor) (Follow signs)

Featured Presentation

When Low-Quality Data Strikes: Fuzzy Tools Provide Clarity in Matching and Deduplication

Speaker: Jared Kuehn, Data Analytics Engineer Skyline Technologies

Summary: You have a high-quality dataset, appropriately keyed, groomed, and trusted by business users. Then you're asked to merge in a new, low-quality dataset. It may contain a different key structure, numerous text fields with typos, or optional fields that are empty on most records. How would you find as many accurate matches as possible? You can define multiple matching algorithms to handle the various discrepancies you find, but it can be difficult and time consuming to prevent missing matches. In this session, I will showcase how you can solve problems like this using the fuzzy tools natively available in SQL Server. I will explain how a fuzzy approach compares to other options such as exact match algorithms, weighing the pros and cons. Finally, I will demonstrate how to set up the groundwork to incorporate fuzzy tools into a data flow solution. By the end of this session, you should have another tool in your toolbelt that can aid you in any matching or data deduplication scenario.

About Jared: Jared has worked in the Business Intelligence industry for over 5 years, primarily in warehouse development and reporting solutions. He has acquired a MCSE certification in Business Intelligence and continues to do research in the realm of fuzzy logic. He's served in multiple industries including Healthcare, Higher Education, Insurance and Retail. He firmly believes in not only learning new technologies, but sharing knowledge with one another to grow together. In his spare time, he enjoys spending his time performing in musical groups and theater productions, nerding out with video games, or spending time with his family.

Thank you sponsors! 


 

 

 

PASSChapterLogo100.jpg 

 

 

sql_micro_sm.gif 

sql_ca_sm.gif 

 

 

Brent Ozar Unlimited

Back to Top
cage-aids
cage-aids
cage-aids
cage-aids