logo separator

[mkgmap-dev] How can we use prefix/suffix feature in road names?

From Gerd Petermann GPetermann_muenchen at hotmail.com on Thu Apr 13 15:35:42 BST 2017

Hi all,

I've compiled two lists using this method:
1) Collect all names in OSM ways with highway=residential from europe.osm.pbf dated 2017-04-04. I've used this commands:
osmfilter europe.o5m --ignore-dependencies --keep= --keep-ways="highway=residential" -o=residential_roads.o5m

I've created a small java program based on splitter to
2) Collect those names with at least one blank or apostroph ( ' )
3) For each name: Find position of blank or apostrop, create prefix as substring from start to position and suffix as substring from position  to end. Set position to next blank/apostrop. Stop if none found.
Each calculated prefix / suffix is counted.
Example : "Chemin de Piere Froide" gives
prefixes "Chemin", "Chemin de", "Chemin de Piere" and
suffixes "de Piere Froide", "Piere Froide", and "Froide"

I've printed those strings with > 1000 occurences sorted by highest occurence first.

It seems to me that we need some language experts to sort out which of the strings are useful prefix / suffix strings.
I am pretty sure that the "prefix"
"Rue Jean"     11476
is not a good candidate but others with smaller numbers are okay, e.g.
"Rua de"     5324
....
"Route du"     3329

The suffixes with multiple words are probably not useful, at least not in those languages that I know a little bit.
Note that my algo is case sensitive.

Maybe we can use these lists to set up a list of prefixes and suffixes ?
I am now compiling those lists for a planet file from 2017-01-05.

Gerd



________________________________________
Von: mkgmap-dev <mkgmap-dev-bounces at lists.mkgmap.org.uk> im Auftrag von Carlos Dávila <cdavilam at orangecorreo.es>
Gesendet: Mittwoch, 12. April 2017 23:08:45
An: Development list for mkgmap
Betreff: Re: [mkgmap-dev] How can we use prefix/suffix feature in road names?

El 12/04/17 a las 21:48, Steve Ratcliffe escribió:
>
> Hi
>
>> There is at least one visible effect of these 0x1e and 0x1f
>> characters: When you zoom out MapSource removes the prefix / suffix
>> part(s) from
>
> In addition there is 0x1b which is like 0x1e, except that it does not
> act as a space.  Only known example is is for "L'" being used as a
> prefix:
>
>   Rue de L'Abbe Vincent
There's also D': Rue D'Aberdeen, Allée D'Albert...
>
> There is also 0x1c which is the non-spacing equivalent of 0x1f, I
> don't know of any examples of that being used in street names.
>
> Since 0x1e and 0x1f are effectively spaces, I've created a patch to
> make them sort along with and just after space.
>
> ..Steve
_______________________________________________
mkgmap-dev mailing list
mkgmap-dev at lists.mkgmap.org.uk
http://www.mkgmap.org.uk/mailman/listinfo/mkgmap-dev
-------------- next part --------------
A non-text attachment was scrubbed...
Name: suffix-prefix-europe.zip
Type: application/x-zip-compressed
Size: 15016 bytes
Desc: suffix-prefix-europe.zip
URL: <http://www.mkgmap.org.uk/pipermail/mkgmap-dev/attachments/20170413/647acfdf/attachment.bin>


More information about the mkgmap-dev mailing list