IWETHEY v. 0.3.0 | TODO
1,095 registered users | 0 active users | 0 LpH | Statistics
Login | Create New User
IWETHEY Banner

Welcome to IWETHEY!

New I think #2 does what I did
I only used geocoding because it was free for me and I had code for it. I was using it to turn an address into a location, and using that to match addresses. But if you can standardize addresses another way, go for it.

For my project I was able to ignore the name issue - I was just merging several lists into one, and needed to spit out "these aren't duplicated" and "these may be, look them over by eye". So it was only semi-automated.

Cheers,
Ben
I have come to believe that idealism without discipline is a quick road to disaster, while discipline without idealism is pointless. -- Aaron Ward (my brother)
New Your geocoding process standardized first
When we geocode stuff, 1st step is to standardize.
They just did it for you under the covers.
New Exactly
Come to think of it, I really did need the geocoding.

There is a problem with apartment buildings where on one lot you have 2 buildings with different addresses, but they're really the same. So I counted as possible dupes to review any two that were close. (I used "within adjacent 100' by 100' boxes, so that range was 100-300 feet.) That I couldn't have done without geocoding, but it was only a nice to have anyways. And it obviously doesn't apply in this case.

Cheers,
Ben
I have come to believe that idealism without discipline is a quick road to disaster, while discipline without idealism is pointless. -- Aaron Ward (my brother)
Expand Edited by ben_tilly Jan. 18, 2006, 09:28:16 PM EST
     Postal address list cleansing - (Steve Lowe) - (21)
         I just did a deduping project kind of like that - (ben_tilly) - (1)
             Thanks for the tip! - (Steve Lowe)
         do a dump then sort by address. the dupes are identified - (boxley) - (4)
             YM, delete HALF of them... Or NOBODY at that address is left -NT - (CRConrad)
             2 things - (ben_tilly) - (2)
                 so going by lat and long makes more sense, thx -NT - (boxley) - (1)
                     There are apps that standardize addresses - (drewk)
         Here ya go - (broomberg) - (12)
             I think #2 does what I did - (ben_tilly) - (2)
                 Your geocoding process standardized first - (broomberg) - (1)
                     Exactly - (ben_tilly)
             Re: Here ya go - (Steve Lowe) - (8)
                 ObLRPD: "Vote him off the island!" - (Another Scott) - (3)
                     That'd be harder than giving a bath to a bobcat. -NT - (admin) - (2)
                         the trick is... - (cforde) - (1)
                             Talk your talk, wee man. -NT - (admin)
                 Firstlogic match/consolidate is verra nice - (broomberg) - (3)
                     Thanks, having a look. -NT - (Steve Lowe)
                     Re "How much is the cost to mail each duplicate each month?" - (CRConrad) - (1)
                         Exactly - (Steve Lowe)
         Send me a dump of the list in e-mail. - (folkert)

That is The Story.

The rest is just pretty pictures.
125 ms