 Hello, okay, is everybody here my voices and I'm from Taiwan. I'm Wikipedia Taiwan Dennis Chen today's my talk is this is also a we did a Topic, but it is more dealing with the regional Taiwan data set So it was moved to the except track My title is from cross-linking middle-lingual articles on database entries How does we did our link to the whole world's knowledge the Taiwan experience? Right. I'm Here's my online ID. I'm superplex. I'm an open-stream map contributor. No, so we did a contributor and I'm also one of the board member of the working media Taiwan Okay, and during my community time contributing time I'm fighting Vandalism this case is some might be some Chinese people That change all the ZH verities to the simplified Chinese verities this is a one of the reality show from Korea and Chinese is a call Tinen to den by Ren Da挑戰 if they all change the simplified Chinese version for all the various like ZHT dash TW ZH anti and ZH and Also on the open-stream map. I'm also Active contributor and fighting vandalism. This is example also from China that they are drawing a unrealistic bridge between the China and Taiwan that is not possible in the near future due to is tied the This water body between Taiwan and China the Taiwan Strait. It is quite the not a good Construction site that is not possible for tunnel or bridges Okay There's an important questions that is anyone doesn't know wiki data here So I assume everybody in this room knows about wiki data so and of course Wikipedia then Why should we pay attention with data is not easy to understand not easy to contributions But it has some features like it's more structured in wikipedia because wikipedia is a human readable form It's written in article forms. So if you want to let Program the computer knows what's this? rich set rich form of the knowledge usually have to be a Matching readable and We also have some cross language abilities that it is stored language neutral form and then using label to notes With the cut back in English. What should it be called in Chinese? What it should be called and then both could be Covering language international language and regional language And a simple words to describe wiki data I will say it is the database of database the index other third party Entries like it can store open stream at region ID way ID No IDs on the wiki data platform and also from some government Desserts at Taiwan we have the school code and the river code that is maintained by Taiwanese government Okay, here's some quick step of the history of wiki data in Taiwan and 2013 before the wiki manai in Hong Kong We have the honor to invite Lydia to come to Taiwan to have a talk and one of the biggest open source conference in Taiwan Coast Club but between 2014 and 2019 is Not active the Taiwanese community have not such interests in the wiki data contributions and Okay, I miss bill something There's an academic a scholar called mr. Drone Ding Rui He has some research about wiki data and She has some interest wiki data even so found that he come to the first wiki.com and birding and 2019 is has some thing changed the local Chinese community Have more interest of wiki data and start to have some a massive contribution on the platform like we started in the imported Laws the Taiwanese laws village school library Episodial dramas in Taiwan the government publications and Research papers and we also start the open shimak ex wiki data monthly meetup in Taiwan it's this is holding Taipei and if you know that the open shimak and wiki data is quite of the Very very close together. So we have in Taipei. We have a community made up together each month Okay, here's a quick example for wiki data This is a community venue that we hold the monthly meetup in Taipei is more space Taipei that we created a wiki data items for this space and It's an instance of venue and we have the office site Website and also no ID. It is all state all written down on wiki data platform Okay for the with data on maps we Everybody should know that that the map behind the wiki data articles are the customized version of a branching map that use the open shimak data to to Render to fix the need of wiki pdr's or who displays and here are some example that the This is advanced data type type called Relations that we can store something like village reverse schools and Train metro library station Okay, here's some promotion information. We have the wiki.com this year in Taiwan That was a hold in October 28 and 29 We will have a local track and also an international track They will all both will be streaming online and the international track is European US time zone friendly They hope everyone can join not only on site, but also visually Okay, this is the articles my review of wiki data and fluorescence you might saw this articles and one of the pictures that have It deploys those We can add the items with coordination's that in between 2021 and 2023 It's a quite a big difference between these two years. We have some masses edited about Items with we adding so much Coordination to various types of things like the rich or schools that make a very very difference between these two years that is we can get Make a Jeff Peacher to display the difference Okay, here's a quick jump. Everybody knows this Is that this 10 years ago and every year and the end of October we will celebrate wiki data birthday And of course and wiki.com this year in Taipei We will also have the onsite celebration sessions and also the visual platform we use will also have the Yeah, the celebrations for the birthday of wiki data and here's a quick summarize about the relation between the Wiki data and the other various wiki media projects that we all know is that wiki pdr has a mutual a lingo mutated Mutated media that is stored on all on wiki comments and for structural data on wiki day pdr we can transform it into a form that it can store on wiki data and Okay, this is this is a quick Statics statistics about it's over one million. I think it's one billion No, no, not one billion. It's a 100 million items on wiki data and the total Volume of the data set if you want to download it is over 100 GB Okay, we even have some language contribution But if you're familiar wiki data, we have wiki a lexamen that this is a regional language is called Taiwanese Down is taking in Taiwan that we have this This is the first personal single form war wiki data lexamen Okay, here's a quick I Have you know seven minutes left? So I will speed up my talk Okay, this is a vintage project. We go on in Taiwan That on we data and also open stream map. We start to draw in map boundaries And also created meta data on wiki data. So in front 2019 it's been over four years on open stream map and Because this is quite hard to process. So we use in a semi semi Import to open stream map not force import and The similar case is from Philippines that they serve that he start to Drawing barangang on open stream and wiki data And if you have some knowledge of Taiwan, we have a quite a huge number of villages over 7,000 near 8,000s village in Taiwan we think it The household register ID from the Taiwanese government also OSM region ID all stored on the wiki data and vice-verses We store the wiki data item ID of stream app So it is it is a cross thing between each other Okay, here's a visualizations on open stream map that the totally near 8,000s village all on this Visualized Website overpass turbo and this is on we doing wiki data query research search query This is the results that we can see the central point of each village in near the Taipei city Okay, this is I already said this one skip this okay, but we have some arrows because we are humans and each humans had different opinion and different ways of doing things so we have something and The most major error is some people importing village to open wiki data use the old data set It is it's only maintained until 2018 and we're doing the import in 2019. So we missing some Merged or newly established village in the Thailand city that they were doing it in 2018 So the solution is we use a new data set that is maintained by the other Taiwanese government agencies The other issue is this one if you're familiar with some Taiwanese language schema This is the town in the key that already established some villages on the ZH mean none Wikipedia and unfortunately these village on the ZH mean none Wikipedia They were mass imported into the wiki data, but without statements So we couldn't use normal wiki data query to find out these Items on the wiki data. So I'm missing these items that we didn't merge with the newly imported village entries other area is There are some Local projects going on to establish village in Taiwan like Zhanghua County and Jiayi County They have created wikipedia articles But they didn't notice that wiki data exists, but they didn't cross-link it on the wiki data items They might have the chance that someone mass imported to establish new wiki data items So this is other area, but compared to the previous one is much more small scale Okay, so we have some one with the program abilities to cross-link using monitor the government data sets and If there's a new village established or some village was removed, disbanded We can use this website to monitor the situations Okay, this one Okay, we also have a project that is cross-linking The river managed by the Taiwanese government that each have a unique ID and also some village rivers already written on Chinese wikipedia And also on the open stream map. We also have to draw the river into a special form that is called open stream map relation Okay, here's some problem that might if you're a long time wikipedia Contributions that the Chibis wikipedia has using some open data data sets to mass import mass establish those wikipedia articles is the the idea of this both accounted at LJS both that have masses created all Rivers, I think is some rivers townships masses on Chibis wikipedia And it will affect it also includes some Taiwanese rivers and town Okay, they use the genus data set to masses in Creative articles and then someone masses you created on wikidata Okay, here are some quick view of the rivers in Taiwan. We have done that the Cross-linking we created with our items linking to wikipedia articles linking to open stream map in relation ID And also some has some Joe coordination's like the mouth of the rivers or creeks Okay, here. I think I have to hurry up Okay, a few This is a project that is cooperation between open stream map wikipedia community they use this project name suggestion index to Standardize some brand chance tags on both on the wikipedia open stream map That use it is announced on standard map us 2019 it once is a pet Pet project, but now it's more have some heavily in full involvement and investment for this project and We have some someone created someone has maintained and some Change stores in Taiwan using the name suggestion index projects Okay, this is a quick overview that We there has open shima has used quite a lot with data items that we use some secondary key value pair on open stream map to maintain some data set on open stream map and also this is a Okay statues that we can use also link in the wikidata using the The secondary tag subject column wikidata to do yeah To notice that Chiang Kai-shek status or something else or the kung fu shi status Okay, or even can trace the entomology of some stuff back. There are so many Zhong Zheng road in Taiwan so you can you Can link it to wikidata adding this secondary open shima tag All right, there's schools of visualizations Well a quick overview about it We have the full set of the wikidata schools on wikidata And we are trying very hard to link it to open stream map and vice-versa open stream map to wikidata And we have a project page to trace how to tag how to Add the statement on wikidata Okay, so right now the I think all the schools are on wikidata but Then it is a very hard work because there will be new schools or disbanded school there So we have to keep tracing every year. It was quite time-consuming and we have to read the news articles or Or the government announcement to keep track of it I hear some overview about what's next for the Taiwanese wikidata community that some foreigners that had massive imported ethos and in Taiwan using the Agoda or booking comms ID That all created in the English, but it's making Chinese label Okay, it's finished school. They both have to be up to date. So it's well government he has to Yeah Has to keep an eye on it and also for the purchase sites We all have wikipedia articles links, but unfortunately it is not well Maintained and on wikidata. So we have to work hard to add the correct statements on wikidata and Rivers we have to deal with the Chibis wikipedia dedicated items and For bus and river fans. We have a masses of Portals on wiki comments, but these are wiki comments category doesn't link to wiki doesn't link to the correspondent wiki the items and I think in the most cases we didn't have wikidata items for bus on wikidata So it's possible to add in this missing Okay, this is a conclusion that is a chance to maintenance and quite consume the community's time and yeah And it's more easy to do a mass imported But if you have to do additional edit is quite crime consuming and need careful plan okay, and for the future the community should do more thematic workshop and Try to link into different four party database that will maintain or or not and we're doing some metalingal not only for the International big language both of the Taiwanese Natural language that I don't need diggy. Don't need hot car tiny for Mosa languages Okay. Thank you everybody. I slice over I think I have to hand over to next speaker. Thank you very much