Replies: 3 comments 5 replies
-
Even if you have the ability to represent multiple language versions of a key value, there is no guaranty that an editor (a human using an editing software) updates all the languages variants even if you show them to him and ask to review. Why? Firstly the is only interested in a limited set of language variants (say his native language and maybe English in addition), secondly he/she/it has limited time and a set of information to insert into the database, for example notes he made for himself on the ground. The data model could allow the associate multiple values with an key, say 7000 to match every language used by humans nowadays plus multiple local and regional variants (it is not guarantied that every object has only one local name). For editors it could be useful if locations have multiple names (such as many villages and towns in the German region of Nordfriesland where locations have often multiple names, one in the local dialect of the native Frisian language and one in the prevailing language). But the same effect can be achieved if editing software offers suitable interfaces for that. The mapping of multiple values to one key is possible, but it needs an additional indirection which make the software logic on the server side and on the client side more complicated and breaks many software used nowadays to process OpenStreetMap data. If have no statistics about the usage of multilanguage usage in the OSM database, but I assume that only a very small amount of elements have name:LANGUAGE tags or similar tags. |
Beta Was this translation helpful? Give feedback.
-
Borrowing from other translation system, I guess it might be possible to:
- Force values of some key, like name=*, be stored with language tag (Editing software can apply national language default so that when user entering data for like the US users don't need to specify English when they
are typing English name)
- When a user update default language value, flag other language data as outdated until they are being verified or updated by another user.
- Editing software show transliteration of non-latin text in latin characters like what wikidata's editor is doing, to allow easy verification and deletion of inappropriate or outdated data
|
Beta Was this translation helpful? Give feedback.
-
I think that at least for some object (countries, provinces, states, counties, regions, municipalities, cities, towns and villages) the names should be stored on Wikidata and fetched as needed (or on a regular basis). Just to give an example, the relation for the USA has 333 name tags (including short names, official names and old names), England has 227 names, Germany has 306 names. It's pretty much impossible to maintain such amount of data. Wikidata most certainly has a lot more names in diffrent languages and I see no point in duplicating this data. Even if it's not possible to do drop all of these names from OSM, there can be a bot which watches the Wikidata entries and updates OSM on regular basis. |
Beta Was this translation helpful? Give feedback.
-
Currently, most multilingual data are stored wiht data for each languages of each tag entered as its own individual tag. For example name:en=, name:ja=, and so on, and that's not just name but also other tags where local language information and/or their translation of the object is expected. However it is not exactly easy to identify, translate, store, and present these data to people who use multiple languages in the current system. It also cannot avoid the problem of same language data being not up to date if multiple language tag data are entered into OSM database. How can the data model be improved to deal with this?
Beta Was this translation helpful? Give feedback.
All reactions