Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing apostrophe in street name causes locality garbage deduction #228

Open
bstratto opened this issue Mar 1, 2021 · 1 comment
Open

Comments

@bstratto
Copy link

bstratto commented Mar 1, 2021

The absence of an apostrophe in a street name makes the Geocoder see the remaining street name as locality garbage.
A ballpark estimate for HLTH addresses, scoring below 90 falling in this scenario is 500 – 1500.

Example 1:
addressString: 202 6 1483 LAMEY S MILL R, VANCOUVER, BC
Geocodes to: 202 Sixth Ave, New Westminster, BC
Score: 90
Faults: [LOCALITY_GARBAGE.notAllowed:3, LOCALITY.isAlias:1, STREET_TYPE.missing:6]

Example 2:
addressString: BROADWAY LODGE 1377 LAMEY S MILL RD, VANCOUVER, BC
Geocodes to: 1377 Lamey's Mill Rd, Vancouver, BC
Score: 79
Faults: [INITIAL_GARBAGE.notAllowed:3, LOCALITY_GARBAGE.notAllowed:3, STREET_NAME.spelledWrong:2, STREET_TYPE.missing:6, STREET_NAME.partialMatch:1, FAULTS.tooMany:6]

Example 3:
addressString: 811 1483 LAMEY S MILL RD, VANCOUVER, BC
Geocodes to: UNIT 811 -- 1483 Lamey's Mill Rd, Vancouver, BC
Score: 81
Faults: [LOCALITY_GARBAGE.notAllowed:3, UNIT_DESIGNATOR.missing:0, UNIT_NUMBER.notMatched:1, STREET_NAME.spelledWrong:2, STREET_TYPE.missing:6, STREET_NAME.partialMatch:1, FAULTS.tooMany:6]

Example 4:
addressString: 412 SMUGGLER S COVE RD, BOWEN ISLAND, BC
Geocodes to: 412 Smuggler's Cove Rd, Bowen Island, BC
Score: 88
Faults: [LOCALITY_GARBAGE.notAllowed:3, STREET_NAME.spelledWrong:2, STREET_TYPE.missing:6, STREET_NAME.partialMatch:1]

@cmhodgson
Copy link
Collaborator

While I agree this is an interesting issue to address, I'd like to point out that it isn't the missing apostrophe, as apostrophes are ignored, it is the fact that it was replaced with a space, and the spaces are meaningful. Solving this problem would be similar to solving any "misplaced additional space character" type of mispelling/typo correction.

However, the pattern of a singular "s" character following another word could prompt an alternative interpretation of it being attached to the word (especially if there is no valid match to it as the "south" directional it would otherwise be interpreted as).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants