Wikidata:Requests for permissions/Bot/AroundTheBot
- The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section. A summary of the conclusions reached follows.
- Approved. (admin-closure) --Wüstenspringmaus talk 13:00, 12 September 2024 (UTC)[reply]
AroundTheBot (talk • contribs • new items • new lexemes • SUL • Block log • User rights log • User rights • xtools)
Operator: Hardwigg (talk • contribs • logs) & BrigidGit (talk • contribs • logs)
Task/s: Automated import of Albanian nouns with IPA from Wiktionary, with the long-term goal of using this data to do pronunciation-based comparison/word evolution between languages.
Code: This notebook performs initial kaikki dataset analysis/cleanup. This notebook (run inside PAWS) coerces the cleaned up data to Wikidata format and performs the actual import.
Function details: We worked with the kaikki dataset, a structured parsing of wiktionary, to find relevant Albanian nouns with IPA pronunciation, remove any noisy entries, coerce the words into the lexeme format used by Wikidata, and then import them into Wikidata. --Hardwigg (talk) 12:22, 18 July 2024 (UTC) & @BrigidGit[reply]
- Please make some test edits Ymblanter (talk) 20:22, 28 July 2024 (UTC)[reply]
- Awesome! We will get that done this week. Hardwigg (talk) 23:59, 2 August 2024 (UTC)[reply]
- @Ymblanter Ok we're still working through a few fixes with the script, but should be ready to do a 50-edit test set by next week. You can see the first few automated edits we've been making here: Special:Contributions/AroundTheBot Hardwigg (talk) 10:27, 15 August 2024 (UTC)[reply]
- @Ymblanter Ok, we had to make a few more data cleanup fixes, but now things are running smoothly. The first 50 just completed. Let us know if/when we can proceed with the full import. Hardwigg (talk) 12:47, 12 September 2024 (UTC)[reply]
- Awesome! We will get that done this week. Hardwigg (talk) 23:59, 2 August 2024 (UTC)[reply]