Commons:Bots/Requests

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search
This project page in other languages:

Shortcut: COM:BRFA

Gnome-system-run.svg

If you want to run a bot on Commons, you must get permission first. To do so, file a request following the instructions below.

Please read Commons:Bots before making a request for bot permission.

Requests made on this page are automatically transcluded in Commons:Requests and votes for wider comment.

Requests for permission to run a bot[edit]

Before making a bot request, please read the new version of the Commons:Bots page. Read Commons:Bots#Information on bots and make sure you have added the required details to the bot's page. A good example can be found here.

When complete, pages listed here should be archived to Commons:Bots/Archive.

Any user may comment on the merits of the request to run a bot. Please give reasons, as that makes it easier for the closing bureaucrat. Read Commons:Bots before commenting.

BebiezazaBot (talk · contribs)[edit]

Operator: Bebiezaza (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information) Can be contacted through Commons, Thai Wikisource, Thai Wikipedia, or English Wikipedia

Bot's tasks for which permission is being sought: Upload of documents from Royal Thai Government Gazette, which are works in public domain according to section 7 of the Copyright Act, BE 2537 (1994) of Thailand. (More information below)

Automatic or manually assisted: Automatic (semi-assisted through first uploads of its batch, to look out for any mistakes)

Edit type (e.g. Continuous, daily, one time run): One time run (several batches)

Maximum edit rate (e.g. edits per minute): 10 seconds between edits, on Wikimedia Commons and Thai Wikisource simultaeneously

When: The bot operates intermittently (in batches). Activity depends on the availability of file information provided to the bot, it can be started on demand by the local operator, on a stand alone pc

Bot flag requested: (Y/N): Y

Programming language(s): JavaScript (Node.js - mwn library)

The source code is available on GitHub.

Royal Thai Government Gazette is collection of legislations, regulations and notifications made by Royal Thai Cabinet Press. Its content, including text, tables, figures and emblems are deemed public domain according to section 7 of the Copyright Act, BE 2537 (1994) of Thailand. However, the use of some content (such as emblems) is strictly limited by several laws, including "Government Mark Act, 1939" (information from {{PD-TH-exempt}}'s documentation). Works already uploaded to Wikimedia Commons can be found at Category:Works published in the Royal Thai Government Gazette. Still, there are many more files to be uploaded for proofreading at Thai Wikisource. These are some example files which had been uploaded by users:

Bebiezaza (talk) 17:40, 9 January 2023 (UTC)Reply[reply]

Discussion

Question: Although I have wrote this request for batches, do I still need to request a new task for each upload batches of Royal Thai Government Gazette works? --Bebiezaza (talk) 17:40, 9 January 2023 (UTC)Reply[reply]

Please make test run. I think you'll need new request in case of significant changes to bot code that results in changes in file description/structured data. --EugeneZelenko (talk) 15:33, 11 January 2023 (UTC)Reply[reply]
First test run, 30 files uploaded. Experienced random 403 Forbidden on some files so these uploads were not continuous. I was also changing edit summary for earlier files. Considering another test run tomorrow. --Bebiezaza (talk) 18:01, 11 January 2023 (UTC)Reply[reply]
Another test run, 20 files uploaded. 403 Forbidden error has been accounted for in the code (immediate retry). Everything seems to be normal here now. --Bebiezaza (talk) 10:26, 12 January 2023 (UTC)Reply[reply]
I would suggest to create dedicated category instead of adding Category:2014 books PDF files, Category:2014 books from Thailand, Category:Thai government PDF files, Category:PDF files in Thai to each file.
Is it possible to use Wikidata items for Author and Publisher?
EugeneZelenko (talk) 15:35, 12 January 2023 (UTC)Reply[reply]
1. There is a dedicated category applied to every files in this batch named Category:Orders of the National Council for Peace and Order but the page has not been created as of yet. I find that it's a "tradition" to add 4 aforementioned categories to these types of files. Technically, files in this first batch are published in the span of 6 years so files toward the end of this batch will be placed into 2015/6/7/8/9 books ...... instead of 2014 categories.
2. For this batch, the Wikidata item for Author is d:Q16919521 (National Council for Peace and Order), and for Publisher is d:Q16305279 (Secretariat of the Cabinet (Thailand)). Though, I did not know where to place these Wikidata links on the page content.
Bebiezaza (talk) 17:02, 12 January 2023 (UTC)Reply[reply]
Author parameter of Template:Book support Wikidata item. Same still need to be done for Publisher. City is another reasonable candidate for that. --EugeneZelenko (talk) 16:03, 13 January 2023 (UTC)Reply[reply]
Like this? --Bebiezaza (talk) 17:13, 13 January 2023 (UTC)Reply[reply]
Yes. --EugeneZelenko (talk) 15:30, 14 January 2023 (UTC)Reply[reply]
Second test run, following files uploaded:
  1. File:คำสั่ง คสช ๒๕๕๗-๐๖๑.pdf
  2. File:คำสั่ง คสช ๒๕๕๗-๐๖๒.pdf
  3. File:คำสั่ง คสช ๒๕๕๗-๐๖๓.pdf
  4. File:คำสั่ง คสช ๒๕๕๗-๐๖๔.pdf
  5. File:คำสั่ง คสช ๒๕๕๗-๐๖๕.pdf
For files previously uploaded, I can go through and update them by myself --Bebiezaza (talk) 18:35, 14 January 2023 (UTC)Reply[reply]
Looks OK for me, but will be good idea to enhance Wikidata support in Template:Book. --EugeneZelenko (talk) 15:40, 15 January 2023 (UTC)Reply[reply]
Template:Book (Module:Artwork) is template-protected. And I don't think I'm qualified enough to edit the module by myself. --Bebiezaza (talk) 16:04, 15 January 2023 (UTC)Reply[reply]
I just checked some other uploads and found out that I can use Institution:Cabinet Secretariat of Thailand for Publisher field, will update to use it for the next run --Bebiezaza (talk) 05:57, 16 January 2023 (UTC)Reply[reply]
Did you leave request on this template talk page? At least maintainers should be aware about it. --EugeneZelenko (talk) 15:55, 16 January 2023 (UTC)Reply[reply]
@EugeneZelenko: What do you mean by that? Module:Artwork? --Bebiezaza (talk) 16:27, 16 January 2023 (UTC)Reply[reply]
Both, if changes should be made in both. --EugeneZelenko (talk) 16:30, 16 January 2023 (UTC)Reply[reply]
I have made the recommendation here. But in my opinion, this is becoming an off-topic discussion for a bot request. Is my bot good to go for receiving the bot flag? --Bebiezaza (talk) 12:04, 18 January 2023 (UTC)Reply[reply]
I will not be able to use computers next week (22-28 January) and may not be able to respond to any updates here until the following week. --Bebiezaza (talk) 04:30, 21 January 2023 (UTC)Reply[reply]

WLKBot (talk · contribs)[edit]

Operator: WLKBot (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information) User:Kim Bach is the operator, he can be contacted through commons mail, User:MSanderhoff can also be contacted.

Bot's tasks for which permission is being sought: Upload of public domain images of artwork and related metadata, using the Template:Artwork of artwork provided through partnerships with Danish GLAMS.

The first partnership is with SMK - The National Gallery of Denmark a project page is created for the purpose Commons:SMK - Statens Museum for Kunst

For the SMK contribution, the images are downloaded and metadata is mapped to the Artwork template, utilising the Statens Museum for Kunst, SMK API: entry KMS1 and the wikilabskultur Artwork template preprocessor.

Automatic or manually assisted: The bot is manually assisted and will most likely be running from a stand alone computer

Edit type (e.g. Continuous, daily, one time run): One time run (several batches)

When: The bot operates at specified intervals (batches). Activity depends on the availability of new batches from GLAMs, it can be started on demand by the local operator, on a stand alone pc

Maximum edit rate (e.g. edits per minute): Most likely 10-50 edits per minute, but only in short bursts, and it can be set to a much lower rate. The first proposed upload will be in the form of approx. 500 files of 10-30MB in size, time to finish is not of the essence, the total size of the contribution from SMK currently is approx. 70.000 images, the collection is expected to be quite static.

Bot flag requested: (Y/N): Y

Programming language(s): The bot is written in Python using the pywikibot library

The source code is available on GitHub


WLKBot (talk) 17:13, 16 December 2021 (UTC)Reply[reply]

Discussion
First test run, a total of 25 images uploaded. --WLKBot (talk) 09:44, 20 December 2021 (UTC)/User:Kim BachReply[reply]
Could medium be internationalized with template or Wikidata item? If not, please use language tag. Same for object type like Blyant. Could bot add painting by or more detailed category? Please also make batch categories hidden. --EugeneZelenko (talk) 16:04, 20 December 2021 (UTC)Reply[reply]
This file should have a default sortkey (Defaultsort) "Købke, Christen, 1838" as we generally sort items by the creator's last name.
It should have a category of "Paintings by Christen Købke in Statens Museum for Kunst" with a local sortkey of "|1838" (after the category name). I did that manually.
It should have a category of "Paintings by Christen Købke" with a local sortkey of "|1838" (after the category name). Is this not redundant? No, because those paintings will eventually be divided up as landscapes, marine paintings, portraits and so on.
It might have a category of "19th-century paintings in the Statens Museum for Kunst". Note: Here it is the Statens Museum for Kunst. Don't ask.
It might have a category of "Landscape paintings in the Statens Museum for Kunst".
It might have a category of "1838 paintings from Denmark".
Happy programming. Cheers Rsteen (talk) 12:54, 21 December 2021 (UTC)Reply[reply]
Thank you for your comment, and the detailed example, it's very helpful, I can definitely improve on the categorisation along the lines you suggested WLKBot (talk) 21:00, 22 December 2021 (UTC)Reply[reply]
@Kim Bach: please don't use your bot account to make manual edits like the previous comment.
I enjoyed visiting the SMK and looking forward to having all this content on Commons! Multichill (talk) 18:29, 1 January 2022 (UTC)Reply[reply]
@Kim Bach and MSanderhoff:  ? --Krd 04:26, 5 February 2022 (UTC)Reply[reply]
@Krd I assume that the "?" means that you'd like an update :-). We're busy implementing the changes suggested by the community, and we will not run another test until we're confident in the changes we're making, this should only be weeks away. Kim Bach (talk) 13:59, 6 February 2022 (UTC)Reply[reply]

Please report current status and intentions. --Krd 15:12, 28 November 2022 (UTC)Reply[reply]

@Krd
Current status is that we've addressed most of the suggestions by EugeneZelenko and Multichill and some of the suggestions by RSteen
This is a list of the changes we've made
  • medium has been wrapped in language template and will be using Wikidata items in test and production runs
  • object type has been wrapped in language template and will be using Wikidata items in in test and production runs
  • painting/... by categories added
  • batch categories made hidden
  • No longer adds Category:Images released under the CC0 1.0 Universal license by Statens Museum for Kunst
  • Wraps the medium field in {{Da}}, will move to Wikidata items in test and production runs
  • We'll be adding structured data using QuickStatements after we've uploaded the images, we will add the Wikidata-item to the Artwork template when we have created the Wikidata-items or identified already existing Wikidata-items
  • We've created two sections for the copyright and permissions
  • We're no longer using Creator template, we've switched to using Wikidata items, and will not upload images that has no Wikidata item for creators. The idea is to add the ones that are missing to Wikidata as we go along, using QuickStatements
  • We've added ...by... categories, but we think that most of the might have suggestions by Rsteen are better handled through structured data, which we'll be adding using QuickStatements
Below is an example of a file that was uploaded manually, where the Wikitext was generated by our updated code, and used to improve the code, we'll perform a few more of these before doing a test run, but we feel like we're ready
Giovanni Battista Piranesi, Det indre af Pantheon, 1768, KKSgb9860-86, Statens Museum for Kunst
The intention is to resume testing, firstly by doing some more manual uploads, in the context of Kim Bach, in preparation of new test runs.
Pending approval we'll move slowly to production runs, the plan is to do that one creator at a time.

--Kim Bach (talk) 05:32, 30 November 2022 (UTC)Reply[reply]

Please feel free to do a small test run. Krd 05:59, 30 November 2022 (UTC)Reply[reply]
@Krd We've made a test run of 20 new media files. We're of course referencing a number of new categories, we should probably create those as well. Maybe as we go along, maybe at a later point in time.
We're referencing a number of object types that aren't created yet. It looks like this is done in LUA code. I do have all the Q item numbers of the object types we're using on hand, but we're also planning on adding structure data using QuickStatemenst, we're currently testing that.
Regards --Kim Bach (talk) 10:05, 3 December 2022 (UTC)Reply[reply]
Since you (EugeneZelenko, Multichill and RSteen) commented on our first test run last year, I’d like you to know that we’re completed our second test run, and that we”ve tried to address most of your suggestions. —Kim Bach (talk) 09:17, 6 December 2022 (UTC)Reply[reply]
object type should be internationalized. Also excessive indentation for license tags and newlines between license tags and categories. --EugeneZelenko (talk) 16:10, 6 December 2022 (UTC)Reply[reply]
Had a quick look, nice images! My points
  • I see a lot of redundant white space. Please trim a bit more,
  • I'm not a huge fan of extra information fields. What's the point of adding "SMK record created", "SMK record modified" and the (broken) iiif link?
  • You're getting a warning because you are using {{PD-old}}. I guess most works are covered by {{PD-old-100-expired}}?
  • In the upload edit summary I would put something like "uploaded artwork from https://open.smk.dk/en/artwork/image/KKSgb22228" instead of "created artwork"
Multichill (talk) 18:09, 6 December 2022 (UTC)Reply[reply]
Thank you Multichill, I've incorporated your suggestions and I'll be omitting the "other fields" in the future, they were meant for internal bookkeeping (created and updated timestamps of metadata from SMK)
I've updated the wikitext for this image, trying to take your and Eugene's suggestions into account
Tobias Stimmer, Romerne indtager Satricum, 1574, KKSgb22345, Statens Museum for Kunst
Kim Bach (talk) 22:22, 6 December 2022 (UTC)Reply[reply]
Thank you EugeneZelenko, I can address your comments in this way:
Internationalisation of object type
I suppose you mean that I should use this:
  • object type={{en|Woodcut print}}
Excessive white space
I can beautify the wikitext along these guidelines:
  1. No leading whitespace
  2. No spaces between equal signs
  3. No double line feeds
For instance:
{{Artwork
|artist=<Artist Name>
|title=<The Title>
|description=<The Description>
...
}} Kim Bach (talk) 19:03, 6 December 2022 (UTC)Reply[reply]
On my opinion spaces after pipes make text more readable. --EugeneZelenko (talk) 15:39, 7 December 2022 (UTC)Reply[reply]
Agree, bit too much trimming see Template:Artwork#Usage for a good example how it should look. Multichill (talk) 18:18, 7 December 2022 (UTC)Reply[reply]
Thanks, I’ll go for that, is at ends with the suggestion by Eugene (space after pipe, I guess that’s not to importerne.
Another possible issue is the new lines. They flush left, and it doesn”t look good. Is there a way to add indentation that doesn’t show up in the rendered wiki page, I suppose the List templates could be used for that purpose. Kim Bach (talk) 14:40, 8 December 2022 (UTC)Reply[reply]
I've made changes that tries to accommodate your standard usage suggestion - this is a "bit" confusing because the Usage sample and the Multilingual sample differs :-/ - and the suggestion with spaces trailing pipes by EugeneZelenko. I've updated the wikitext for the Tobias Stimmer image to reflect the suggested changes :-) Kim Bach (talk) 11:09, 9 December 2022 (UTC)Reply[reply]
Title consists from two tags. Is it possible to merge title in two sentences in one tag? Also there is d:Q18219090, so will be good idea to use it instead of text. --EugeneZelenko (talk) 15:54, 9 December 2022 (UTC)Reply[reply]
Multiple titles
The reason we have two titles for this particular item, is that the museum DB can have several titles for an item.
The best thing to do, might be to limit it to one - since one of the titles is considered official by the museum. We could use that, and then, at a later point in time, add more titles to wikidata, we could also change it to one line, like you suggested.
Using wikidata
Wikidata, works fine for the medium tag wrapped in the Technique template, we can do that, since we've already mapped the Q-numbers.
Is it possible to use Q-numbers for the object type too?
We've tried it, with no luck in nailing the syntax. Kim Bach (talk) 22:24, 9 December 2022 (UTC)Reply[reply]
Looks like Module:Artwork should be enhanced. --EugeneZelenko (talk) 15:17, 10 December 2022 (UTC)Reply[reply]
Hi. Take a look at this upload File:Elisabeth Jerichau Baumann, En såret dansk kriger, 1865, KMS852, Statens Museum for Kunst.jpg. The dimensions are not ok. They are in mm instead of cm. Do not know if this is a general error, and have not seen any comments on it before. Cheers Rsteen (talk) 10:35, 20 December 2022 (UTC)Reply[reply]
Thank you, yes, that was a general error in the code from last year, everything was a factor 10 off, as you noticed, we've fixed that since. I forgot to fix this manually in the batch from last year, so I'll do that. Kim Bach (talk) 15:57, 21 December 2022 (UTC)Reply[reply]
An updated status. We're currently refactoring the category generating code, this will most likely not be ready until january 2023. We still welcome comments to the last test run. Kim Bach (talk) 16:10, 21 December 2022 (UTC)Reply[reply]
@Krd
We've refactored the code, and are now ready to perform a new test. The main change is that we now try to create new category pages, if they don't exist Kim Bach (talk) 21:31, 11 January 2023 (UTC)Reply[reply]
@Multichill and EugeneZelenko we're ready to perform a new test run :-) --Kim Bach (talk) 14:31, 13 January 2023 (UTC)Reply[reply]
Just to clarify: was Module:Artwork improved meanwhile? --EugeneZelenko (talk) 15:59, 13 January 2023 (UTC)Reply[reply]
No, it looks like the procedure involves requesting an edit, I wasn't aware of that, I've added a comment on the talk page. Kim Bach (talk) 20:31, 13 January 2023 (UTC)Reply[reply]