About this board

Previous discussion was archived at User talk:BrokenSegue/Archive 1 on 2022-09-20.





Logo of Wikidata

Welcome to Wikidata, BrokenSegue!

Wikidata is a free knowledge base that you can edit! It can be read and edited by humans and machines alike and you can go to any item page now and add to this ever-growing database!

Need some help getting started? Here are some pages you can familiarize yourself with:

  • Introduction – An introduction to the project.
  • Wikidata tours – Interactive tutorials to show you how Wikidata works.
  • Community portal – The portal for community members.
  • User options – including the 'Babel' extension, to set your language preferences.
  • Contents – The main help page for editing and using the site.
  • Project chat – Discussions about the project.
  • Tools – A collection of user-developed tools to allow for easier completion of some tasks.

Please remember to sign your messages on talk pages by typing four tildes (~~~~); this will automatically insert your username and the date.

If you have any questions, don't hesitate to ask on Project chat. If you want to try out editing, you can use the sandbox to try. Once again, welcome, and I hope you quickly feel comfortable here, and become an active editor for Wikidata.

Best regards!

Danysan1 (talkcontribs)

Hi,

The Psychiq distributed game is broken and suggests to contact you about it: https://wikidata-game.toolforge.org/distributed/#game=84

Thanks for your work

BrokenSegue (talkcontribs)

Thanks for the notice. It ran out of tasks to do. I'll have to find some time to generate some more.

AlphaLemur (talkcontribs)

Hello @BrokenSegue, the Psychic game has run out of tiles in the last week. Hoping you can generate some more for it to do. Thank you maintaining this game - it got me interested in Wikidata.

Reply to "Psychiq distributed game is down"

BorkedBot adding invalid data to {{Q|Q35907}}

3
DeFacto (talkcontribs)

Can the bot be stopped from adding invalid data to Land Rover Ltd. (Q35907), and from restoring content once it has been reverted please? As the very first statement of 'Land Rover Ltd' says, that company stopped manufacturing cars in 2012, and so does not use or control the Land Rover YouTube channel. That channel is owned by Jaguar Land Rover (Q6122893) (JLR), the owners of the Land Rover marque and the manufacturer of Land Rover-branded cars. The more appropriate place for that YouTube channel to be recorded is Land Rover (Q26777551), the entry for the Land Rover marque, in the same way that Jaguar (marque) (Q21170490) is used for the YouTube channel details of JLR's Jaguar brand. Thanks.

BrokenSegue (talkcontribs)

to indicate to the bot that the youtube account is no longer valid you provide an end time on the channel statement.

DeFacto (talkcontribs)

What if it was never valid? I'd suggest that it belongs with the brand (Land Rover (Q26777551)) item, and not with the item of one of the manufacturing companies that produced the brand.

Reply to "BorkedBot adding invalid data to {{Q|Q35907}}"

"interested in" vs "field of work"

2
AVDLCZ (talkcontribs)
BrokenSegue (talkcontribs)

I guess, though I'm not really sure it's more fitting? But it's not worth quibbling about.

Gharouni (talkcontribs)

You have an email. Please check your email.

BrokenSegue (talkcontribs)

done

Gharouni (talkcontribs)

Thanks.

Reply to "email"
ElleAnónime (talkcontribs)

I apologize for the confusion, I made this change because Pride Parade the item of which it's an instance, already has "recurring event" as a statement.

BrokenSegue (talkcontribs)

I see. I don't see why pride parade would always need to be a recurring event.

Reply to "Tbilisi Pride"
Mutaloht (talkcontribs)

I accidentally revealed my IP in Q6863274’s View history(15:44, 2 May), can you please help delete it or hide it?

I'm so sorry for my carelessness, please help me this time.~~~~

BrokenSegue (talkcontribs)

done

Mutaloht (talkcontribs)

Thank you very very very much!

Reply to "Can you do me a favor?"
Infrastruktur (talkcontribs)

I blocked a user you recently had interactions with (User:Leavemealone7) under UCOC 3.1 subsection "threats". Sure, we are not lawyers, but while the foundation enjoys some immunity there is still opportunity to harass editors as evidenced by a somewhat recent legal case.

Trade (talkcontribs)

What recent case?

Infrastruktur (talkcontribs)
Reply to "Note"
Mokadoshi (talkcontribs)

Following up from the Github issue where I said I wanted to discuss the frequency of updates.

I've read Wikidata:Requests for comment/Frequency of YouTube follower count data and I understand the consensus is that we should only update subscriber numbers once per year, or whenever it differs by 10% or more. I'm perfectly fine with this, I agree with the consensus. What I would like to better understand though is how we will deal with manual updates. On enwiki, IP users and other users update these numbers extremely often. I got emailed yesterday about a change that differed by about 0.25%. If these users can't directly update the enwiki article anymore so they're coming here to do this, is that going to cause problems?

I wonder if there is a compromise. For example, what if we stored 2 different values, one that tracks the value over time and one that is simply overwritten with the latest value. Then the latter we can update as often as we want, and this is the value we display on the enwiki article. Or am I misunderstanding the motivations behind the frequency we use today? Mokadoshi (talk) 22:58, 20 February 2024 (UTC)

BrokenSegue (talkcontribs)

Nothing stops users from updating the counts more frequently than the bot. The agreement in the RfC only restricts what BorkedBot does. I don't think the updates on certain popular youtubers happening more often by random users is a problem. I'm more worried that people inexperienced with Wikidata will not know how to update this information (it requires a few steps that aren't obvious to newbies). That's why I think someone (I would love wikimedia germany to do this) should make a "friendly" UI geared to making specific kinds of edits (e.g. add a new subscription count) in a simpler way.

The "2 value" idea you suggest was previously discussed. Wikidata generally has a bias against deleting old (but true) values. But there isn't much precedent here for the kinds of time series data we're considering. But I'm not sure the idea fully resolves the problem. We want to limit the number of values stored but also not crowd the history too much. I'm wondering what update schedule you would want? Weekly? Every 5%? One thing we could consider is having a list of "prominent" youtubers whose values we want to update more frequently.

A bot to compare ourselves to is User:Github-wiki-bot (whose output is also being used in enwiki) which has made hundreds of edits to e.g. webpack (Q22909730) and does so even for items without wiki articles to consume the info.

Mokadoshi (talkcontribs)

I'm personally perfectly fine with the current implementation of: 10%/60 days, checked weekly. It's just that I know this will be too slow for some editors, I've already seen talks that some people would like these to be updated no slower than once per month. Also, the number has to be updated quickly after reaching a major milestone, as an announcement video or tweet will probably cause people to check the article.

So, what do you think about this proposal?

First, we keep the 10%/60 day threshold with the weekly checks, but we add in a daily cron to check the channels that are close to a new milestone. The RFC already allows us to do an out-of-band update for crossing milestones. This fixes the problem of people updating to match an announcement, but doesn't fix the problem of data generally being around 60 days old.

Second, I like your idea of the "prominent" youtubers. I don't know yet who this list would be, I think we can play it by ear. These could be updated each month or so, assuming we are still covering the milestones as said above. I'm guessing we'd need to do another RFP for this, but we can cross that bridge once we get to it.

BrokenSegue (talkcontribs)

Yeah I mean we could do another RfC for this but the github wiki bot had no such process and it spams way more than BorkedBot. I'm just trying to be a good citizen. We could also go the "ask forgiveness" approach.

What counts as a milestone? I presume 1 million. Is 200k a milestone? Just the levels youtube hands out awards?

As for identifying "prominent youtubers". We could just make a wiki page somewhere on enwiki and let people manually list youtubers to get more frequent updates? It's akin to having the "update now" website but more async and documented.

Once per month seems a bit much to me personally. Many youtubers are stagnant for years. If we want this then we would have to do something like the "2 value" idea you mentioned.

Mokadoshi (talkcontribs)

My goal really is just to get ahead of editors feeling like they need to make manual edits, because that is where we run into problems like: how do we teach them how to do this? do we need to make tooling to make it easier for them? etc. It is my opinion that a major way we can get ahead of this is to be proactive in updating for crossing milestones (defined in the RFC as 100k, 1M, 10M, etc). So my suggestion is that if someone is at 99k or similar, we should be checking each day to be the first to update the value when they break 100k so another editor doesn't have the opportunity to fuss with manual edits to do it themselves.

Re: the "prominent youtubers" thing, I think that's a good idea. I think we should implement code to support this but leave it to the WikiProject to deal with figuring out who should be in that list.

Re: once per month, I understand your concern. I know some people expressed interest in once per month, so it would be a shame to put in work to get this adopted just to have people vote against it because of this problem. But maybe we should just cross that bridge when we get to it: if this is really a critical problem for people then it wouldn't be hard to change the code later.

So to summarize, if you agree about the daily checking for milestone crossings, then we can talk on Github about implementing this. The "prominent youtubers" thing I think we should also write an implementation for, but this doesn't need to be done now.

BrokenSegue (talkcontribs)

to implement milestones we just need to run the whole script daily. while it would be optimal to do the "check what's near" logic I think it's just easier to recheck them all daily. I think it only takes 1h to run anyways and compute is free (to me). I can do that.

The prominent yt thing requires new work but isn't hard. We could also use a hidden wikipedia category for this logic? Or maybe a template param? I'm indifferent.

Yeah I think optimally we would have a custom UI where it asks you for the new sub count and it writes to wikidata for you. In a perfect world this would be reusable for lots of cases where updating wikidata by inexperienced users would be wanted. Realistically that isn't going to happen though (unless you want a project to do?) and might even require collaboration with WMF-Germany (though I think they would be game, I hear they are interested in this direction maybe). Otherwise maybe we could just record a video explaining how to do it?

Reply to "BorkedBot - YT update frequency"

Brazilian Portuguese qualifier in Fandom

2
MikutoH (talkcontribs)
BrokenSegue (talkcontribs)
Reply to "Brazilian Portuguese qualifier in Fandom"

When using Desqriptor, it always writes the first choice (green) to the wikidata item

3
Fractec (talkcontribs)

Hey BrokenSegue,

when using Desqriptor, it always picks the first choice. It doesn't matter which one i chose as long as i choose one of the recommended ones.

See my edits on Q370740 and Q3577480.


No adblocker or other script blocking browser features used.

BrokenSegue (talkcontribs)

That is very concerning. Thank you for raising this to my attention. I will take a look this weekend and see if I can fix it / undo the damage

BrokenSegue (talkcontribs)
Reply to "When using Desqriptor, it always writes the first choice (green) to the wikidata item"