Commons:Batch uploading

COM:BATCH


Request a batch upload	Current requests	Past batch uploads	Failed batch uploads

Commons Batch Uploading is a project to centralize the uploading of a collection of files, that have released their work as PD or any Commons compatible license. The files would be assigned to a bot operator who would see how the request would be fulfilled.

Before you request a batch upload here, please read the guide to batch uploading first.

See Commons:Free media resources for further potential batch uploads.

Related project: Commons:Library back up project aims to upload books in public domain from libraries of all languages.

Requests

British Library Flickr

Source to upload from

https://www.flickr.com/photos/britishlibrary/

License

"No known copyright restrictions" on Flickr, with a further Public Domain declaration made on social media (e.g. [1], "A semi-regular reminder that all our photos on Flickr Commons are public domain - you don't need to ask us to use them in any way you can imagine!").

Description

Standard Flickr pages, but Flickr2Commons apparently does not allow works with this licence to be uploaded.

Many historic and OOC documents, but also some original photos (e.g. Living with Machines exhibition). Some may already be on Commons. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 12:26, 29 September 2025 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

IZW-Medienarchiv

Images (mostly historical photographs) from the de:Wasserstraßen- und Schifffahrtsverwaltung des Bundes (the German federal waterway authority), the Internationale Mosel-Gesellschaft (International Moselle Company), and the de:Rhein-Museum Koblenz (Museum of the River Rhine)

Source to upload from

https://izw-medienarchiv.baw.de/

License

Most (all?) of them under a CC-BY-4.0 license

Description

Already discussed at Commons:Bots/Work requests, User:DaxServer is compiling a list.

At https://izw-medienarchiv.baw.de/ there are 28.965 (their number) assets (mostly historical photographs) from the de:Wasserstraßen- und Schifffahrtsverwaltung des Bundes (the German federal waterway authority), the Internationale Mosel-Gesellschaft (International Moselle Company), and the de:Rhein-Museum Koblenz (Museum of the River Rhine), most (all?) of them under a CC-BY-4.0 license.

Several hundred files are already at Category:Kollektionen der Bundesanstalt für Wasserbau. I think all of them would be very useful for articles and information on the various waterways, structures and cities along the rivers etc. The files are organized using de:easydb, so there should be an API (RESTful API over HTML).

Some files, like historical advertisements from shipbuilding companies, might have to be excluded because it's not quite plausible how the museum would be authorized to put these under CC licenses. Rosenzweig τ 13:48, 16 September 2025 (UTC)[reply]

Opinions

I want to ping OhneEisen for coordination. @OhneEisen, you seem to do some contrast work (example: File:Binger Loch HB09115.webp) - are you still working on these new uploads? -- DaxServer (talk) 14:06, 16 September 2025 (UTC)[reply]
@ DaxServer: Yes, I collect everything about transportation and traffic in the Rhine area. If I've got it right, there are currently 198 files from the IZW archive https://commons.wikimedia.org/w/index.php?title=Special%3AMediaSearch&fulltext=Search&search=baw+incategory%3A%22Files+uploaded+by+ohneEisen%22&type=image

(At abload.de there were once over 1000.)

At ISW many images are too dark, should be rotated ,etc. But I would be happy if a bot would take over the upload. Then you could concentrate on the pictures that really need it.

At BAW, everything I saw is under CC BY 4.0, but not at RMK. Images with "Alle Bildrechte bei Rolf Diesler" are partly under CC BY 4.0, but partly not. OhneEisen (talk) 15:56, 16 September 2025 (UTC)[reply]
Thanks @OhneEisen. A bot can take over but I don't think I'd be able to do any contrast work. -- DaxServer (talk) 18:15, 17 September 2025 (UTC)[reply]
Thank You! So i will stop uploading images from IZW by hand and wait for the bot... OhneEisen (talk) 07:08, 18 September 2025 (UTC)[reply]

I have another question about contrast and image editing in general: I edited some of the uploads from the LABW and first treated them as derivatives, i.e. uploaded them as a new file and linked the version: https://commons.wikimedia.org/wiki/File:K%C3%B6ln_Hbf,_Baureihe_23_-_LABW_-_Staatsarchiv_Sigmaringen_Dep._44_T_2_Nr._495_A_17.jpeg

But if the editing is just brightening, I upload as a "new version" over it https://commons.wikimedia.org/wiki/File:Istein-_Planierungsmaschine_am_Nordeingang_(des_Klotz-Tunnels)_-_LABW_-_Staatsarchiv_Freiburg_W_134_Nr._015337c.jpeg

Where is the boundary between version and derivative? OhneEisen (talk) 08:36, 18 September 2025 (UTC)[reply]
@OhneEisen I'd recommend asking that question in Commons:Village pump and see what the community thinks ;) -- DaxServer (talk) 18:31, 18 September 2025 (UTC)[reply]

@ DaxServer: Another big field is the Landesarchiv NRW. Many images there are under CC BY SA 4.0.

Much of what I upload by hand comes from there, especially in the last few days. https://commons.wikimedia.org/w/index.php?title=Special:ListFiles/OhneEisen&ilshowall=1

Here, too, I would be happy if a bot could upload and only the finishing touches had to be done by hand.

I have often uploaded several versions https://commons.wikimedia.org/wiki/File:K%C3%B6ln_Heumarkt_!931_R_RW_0261_00101_R.webp OhneEisen (talk) 16:12, 16 September 2025 (UTC)[reply]
There's a similar project on-going for Landesarchiv BW at Commons:Batch uploading/Landesarchiv Baden-Württemberg. If you'd be able to provide information and support, I'd recommend creating a new Batch upload request for the NRW -- DaxServer (talk) 18:15, 17 September 2025 (UTC)[reply]
@ DaxServer: I've been following the work of the CuratorBot at the LABW since June and I'm thrilled! (I just find the file names a bit long sometimes...) I saw earlier that the CuratorBot was at work again yesterday, with postcards from Sigmaringen, among other things. When will the project be completed? It seems to me that there are still too few participants linking all the material into suitable categories, but that will surely come with time.

Unfortunately, I am not very technically adept and can only provide information on the content of the collections at LA NRW.

Participant P170 created a template in February 2025 that I have been using ever since.

https://commons.wikimedia.org/wiki/Template:LAVNRW

Just this much in advance: digitization at the LA NRW is not as far advanced as at the LABW. The focus so far has been on orthophotos and oblique aerial photographs (these only up to 1945) and larger sections of old maps.

If there is a project like the LABW, I can of course name all the finding aids that I know of so far. OhneEisen (talk) 07:33, 18 September 2025 (UTC)[reply]
I've replied on the LABW project page -- DaxServer (talk) 14:03, 19 September 2025 (UTC)[reply]

Here are the details I extracted: (Example at https://izw-medienarchiv.baw.de/#/detail/c2d24ad3-06ff-44bc-9e93-e2eb2b1d1b90 unless mentioned otherwise)

Key	Example value	Proposed value/usage
abgebildete Wasserstraßen	Gliederung der Bundeswasserstraßen nach VV WSV 1103 (Verwaltungsvorschrift der Wasserstraßen- und Schifffahrtsverwaltung des Bundes 1103). Example: "3901 - Rhein, Hauptstrecke"
Archivnummer	"RMK01007"	accession number
Aufnahmedatum	Dates (seem to be just years?) are extractable. "2012 - 2020"	date
Ausrichtung	3 values: Hochformat / Quadrat / Querformat
Bauphase	Klassifiziert die Bauphase des abgelichteten Objekts. Can have multiple values
BAW-Fachabteilungen	Beteiligte Fachabteilungen der Bundesanstalt für Wasserbau. 4 values: Geotechnik / Wasserbau Binnen / Wasserbau Küste / Zentraler Service / no-value	possibly department
Beschreibung	"Das Taucherglockenschiff CARL STRAAT liegt im Bereich der Koblenzer Brauerei vor Anker. Der Senkkasten kann über einen Heckgalgen bis zu 10 m auf die Flusssohle abgelassen werden."	description
Bilddarstellungen	Repräsentation/Machart des Bildes. Can have multiple values
Bildgrößenklasse [MegaPixel]	Gruppierte Länge x Breite des Bildes. Nutzbar um ähnlich große Bilder zu finden. 12 values. Example: "5 - 10"
Copyright (Anmerkungen)	23 entries have copyright restrictions set	license tag (outside photography template)
Ersteller	798 values. Example: "Günther Lamek; Rhein-Museum Koblenz"	photographer
Farbe	2 values: 'Farbe' / 'Schwarz/Weiß' (except 1, 2, 3)	`{{technique}}` in medium
Gewässerart	Can have multiple values
global object id	2938419@ec3f94f8-c6bd-4a87-806d-b70f18ab49ec seems to be internal ID
Kategorien	Can have multiple values
Latitude		camera coord
Longitude		camera coord
Organisation	2 values: Bundesanstalt für Wasserbau / Wasserstraßen- und Schifffahrtsverwaltung des Bundes ([2])	institution
Ort (Gemeinde)	Gemeinde, in der oder in deren Nähe das Bild aufgenommen wurde bzw. das abgelichtete Objekt steht. 1944 values. Example: "Koblenz"	depicted place
Quelle	5 values: Bundesanstalt für Wasserbau / Bundesanstalt für Wasserbau Karlsruhe / Bundesverband der Deutschen Binnenschifffahrt e.V. / Internationale Moselkommission / Rhein-Museum Koblenz	possibly source
Schlagwörter	Can have multiple values
Sprache	2 values: Deutsch / Keine Sprache
Strecke von [km]	Flusskilometer auf Berechnungsgrundlage der VV WSV 1103 (Verwaltungsvorschrift der Wasserstraßen- und Schifffahrtsverwaltung des Bundes 1103).
Strecke bis [km]	Flusskilometer auf Berechnungsgrundlage der VV WSV 1103 (Verwaltungsvorschrift der Wasserstraßen- und Schifffahrtsverwaltung des Bundes 1103).
Titel	Taucherglockenschiff CARL STRAAAT	title
uuid	c2d24ad3-06ff-44bc-9e93-e2eb2b1d1b90 from URL https://izw-medienarchiv.baw.de/#/detail/c2d24ad3-06ff-44bc-9e93-e2eb2b1d1b90
Zeitraum	Can have multiple values ([3] vs [4]
Öffentlicher Pool	3 values: Historisches Bildarchiv der Bundeswasserstraßen / Historisches Bildarchiv Internationale Mosel-Gesellschaft mbH & Moselkommission / Rhein-Museum Koblenz	possibly source

@Rosenzweig Could you inspect and provide a mapping of the fields that would go into the {{Photograph}} template along with any SDC that is not already covered by other bots? 28.540 JPEGs and 448 TIFFs are listed with 28.988 in total. Let me know if you want to have more details, or just an export of the OpenRefine project that you can inspect yourself. -- DaxServer (talk) 14:48, 19 September 2025 (UTC)[reply]

@DaxServer: Understood. Let me think about it for a while. --Rosenzweig τ 15:34, 20 September 2025 (UTC)[reply]

@DaxServer: I have provided some mappings above, though I'm not sure about all of them. I was testing it on a file I had already uploaded manually, File:Heilbronn Kanalhafen Luftbild 1978 HB02272.jpg. I noticed that there is additional information which I don't see above, like Farbe (which I put into the medium field with {{Technique|black and white photography}}) and Sprache.

For some of the information present, I could not find a proper mapping. Like Ausrichtung (orientation), which would be portrait, square and landscape, but I could not find that in any relevant template or in the SDC properties table (only in some valued image templates).

Bilddarstellungen has values like aerial photograph or portrait of a person, that could go into categories. Likewise, Ort (Gemeinde) could also serve as the basis for some rough initial categorisation (+ "check coordinate" template). abgebildete Wasserstraßen, Strecke von [km], Strecke bis [km], Gewässerart, Kategorien, and Schlagwörter could go into categories too, but it would probably be too much work to map those. Maybe put them in some text-only field like notes? Or as part of the description?

Not sure about global object id and uuid. uuid could serve as part of the source link it seems?

Not sure if the coordinates are camera coordinates or object coordinates. Probably object coordinates, so the template for them should be used.

Quelle and Öffentlicher Pool could go into the source field along with an URL.

I don't think we can use Bauphase, Zeitraum, and Bildgrößenklasse.

Can you provide a spreadsheet file with the extracted information for all of the files? In a format for something like Excel, LibreOffice Calc, Gnumeric etc.? Maybe there are some additional values there which I did not see yet. Regards --Rosenzweig τ 14:27, 26 September 2025 (UTC)[reply]

@Rosenzweig: I've added the Farbe and Sprache to the table above. Here's the export https://files.daxserver.com/files/izw.zip of the data I've in OpenRefine in Excel format along with a sample JSON of the entire data for one object that their API responds with. Additionally, do you also want the Baujahr under the 'abgebildete Objekte' field (example)? -- DaxServer (talk) 09:41, 27 September 2025 (UTC)[reply]

Status

Assigned to	Progress	Bot name	Category
DaxServer (talk · contribs)	Sorting metadata

National Agriculture Imagery Program imagery

Source to upload from

https://coast.noaa.gov/dataviewer/#/imagery (some pages in question: Alabama, Arizona, California, Florida, Georgia, Idaho, Illinois, Indiana, Iowa, Kentucky, Maine, Kansas, Louisiana, Michigan, Minnesota, Mississippi, Missouri, Montana, Oklahoma, Nebraska, New Mexiko, Nevada, North Carolina, North Dakota, Ohio, Pennsylvania, Oregon, South Carolina, South Dakota, Tennessee, Texas (2020), Texas (2018), Utah, Virginia, Washington (state), Wisconsin, West Virginia, Wyoming, )

License

{{PD-USGov-USDA}} (see also: https://catalog.data.gov/dataset/national-agriculture-imagery-program-naip-imagery)

Description

The USDA NAIP imagery is a collection of aerial photographs of the United States that are useful for illustrations of counties, towns, and more. Due to the recent actions of the 2025 Trump administration, the permanent access of the imagery is not guaranteed. These are TIF images, which provide more information like coordinates for GIS systems and simply for viewing. Tools like OpenRefine make uploading easier. Each page should provide a TXT file with all the links of the files.

Data amount
US state	Approx. size (TiB)
Alabama	5.7
Arizona	3.03
California	4.3
Florida	1.67
Georgia	1.61
Idaho	2.26
Illinois	1.53
Indiana	0.994
Iowa	1.53
Kentucky	1.14
Maine	0.952
Kansas	2.18
Louisiana	5.47
Michigan	1.71
Minnesota	2.28
Mississippi	5.29
Missouri	1.92
Montana	3.9
Oklahoma	1.9
Nebraska	2.05
New Mexico	3.23
Nevada	2.73
North Carolina	1.43
North Dakota	1.89
Ohio	0.41
Pennsylvania	1.21
Oregon	2.6
South Carolina	1.74
South Dakota	2.08
Tennessee	1.16
Texas	7.2 (2022); 7.19 (2020)
Utah	4.48
Virginia	1.15
Washingotn (state)	1.87
Wisconsin	1.56
West Virginia	0.7
Wyoming	2.57
Total	ca. 89.43 TiB or ca. 98.3 TB
Beispiel	Beispiel
Beispiel	Beispiel

PantheraLeo1359531 😺 (talk) 10:10, 27 August 2025 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

RGBI-Orthophotos von Niedersachsen

Source to upload from

https://ni-lgln-opengeodata.hub.arcgis.com/apps/lgln-opengeodata::digitales-orthophoto-dop/about (overview)
https://arcgis-geojson.s3.eu-de.cloud-object-storage.appdomain.cloud/dop20/lgln-opengeodata-dop20.geojson (file with links to the resources)

License

CC BY-4.0 (see here (paragraph 7))

Description

The webpage offers RGBI orthophotos, which are the high quality orthophotos compared to the lossy compressed RGB orthophotos. Anyway, there are a lot RGBI orthophotos, so it would by necessary to upload them via a script. The GeoJSON-File contains the links to the RGBI orthophotos. I am covering the RGB orthophotos right now. The images can be used for Wikidata objects of hamlets, villages, towns, etc. Thanks!

PantheraLeo1359531 😺 (talk) 18:27, 22 August 2025 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Maproom.org

Images of maps from atlases, mostly published in the 19th century: see Commons:Bots/Work requests#Donating map images.

Source to upload from

https://maproom.org/

License

I assume these are public domain, so {{PD-old-70-expired}} can be used.

Description

See the bot work request: the website owner has got a database with information for each image. Wikiwerner (talk) 11:11, 22 June 2025 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Aerial photos by Geospatial Information Authority of Japan

Source to upload from

https://service.gsi.go.jp/map-photos/app/map?search=photo

License

{{GJSTU-2.0|terms=https://www.gsi.go.jp/kikakuchousei/kikakuchousei40182.html|attr=国土地理院 (GSI)}}

Description

Describe the content to be uploaded in detail (audio files, images by …), and what makes it valuable to Wikimedia Commons. You can also add any other information that could be useful, like:

Do the media URLs follow a pattern?
yes. e.g.https://service.gsi.go.jp/map-photos/app/map?&id=114514
Does the site have an API?
I don't know.
What else could ease uploading?
I don't know.
Did you contact the site owner?
no.
Is there a template that could be used on the file description pages, or should one be created?
{{Map}}を使用する。Category:Images from the Maps and Geospatial Informationを画像に付与する。ファイル名は「File:GSI_整理番号-コース番号-写真番号_撮影年月日.jpg」の形式に従う。

特急いよのたみ (talk) 11:33, 20 June 2025 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Images from the Swedish National Heritage Board (TIFF)

TIFF versions of images which already exist on Commons as JPEGs.

Source to upload from

https://app.raa.se/open/arkivsok/search

License

Various Creative Commons licenses or public domain.

Description

We already have tens of thousands of images from the Swedish National Heritage Board (see Category:Images from the Swedish National Heritage Board). However, many of our current versions are lossy JPEGs of low resolution. Rather than overwriting these, it would be preferable to instead upload the lossless TIFF originals.

Take for instance File:Sundsvall - KMB - 16000300030376.jpg. For every current JPEG file, an identically named TIFF replacement could be uploaded instead, such as "File:Sundsvall - KMB - 16000300030376.tif". File descriptions and categories could simply be copied over from the current JPEG versions, which would then be marked with {{Superseded}}.

Using the above file as an example, its source page looks like this. Clicking on the download arrow lists the alternative “Arkivformat (tiff)”. However (and while I am not very technically inclined), there also seems to be some sort of API. The bottom of the page gives a URI, in this case [5]. Following the linked URI leads to a JSON sheet, containing another URI ending in “arkivbestandig/1”, which can be used to directly access the TIFF version.

VulpesVulpes42 (talk) 12:25, 16 June 2025 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Legislation.gov.uk Acts of Parliament

Acts of Parliament from 1975 to 1990 in the form of booklets.
Acts of Parliament since then (i.e. from 1991).

Source to upload from

https://www.legislation.gov.uk/

License

As they are hosted on legislation.gov.uk, they are licensed under {{OGL3}}.

Description

Around early April 2025, National Archives (which operates legislation.gov.uk) replaced Stationery Office booklet scans of British Acts to scans from the annual Public General Acts collections ("PGA collections") by overwriting their hosted files, which affected Acts until 1990. This is identifiable by the lack of table of contents and the page numbers. For example, one may refer to the current version of Theft Act 1968 on Commons with the one now hosted on legislation.gov.uk.

Fæ uploaded the available scans of Acts to here up to 1975, but the rest are wiped out by the National Archives. Fortunately many of the requested scans are archived on the Wayback Machine, and the original links follow the format of https://www.legislation.gov.uk/ukpga/YYYY/CC/pdfs/ukpga_YYYY00CC_en.pdf, where:

YYYY is the year of enactment; and
CC is the chapter number, and should be filled with a 0 if the chapter number is smaller than 10 (e.g. chapter 1 means 01).

And as a secondary request, I also request that post-1990 Acts can also be batch uploaded to Commons at the same time. They are unaffected by National Archives' move, so I believe bot uploads for them would be easier to deal with.

To preserve the aforementioned files, I request this batch upload process. Please feel free to comment if there are any other information needed, thank you.廣九直通車 (talk) 13:07, 1 May 2025 (UTC)[reply]

The title of each file on commons should be of the form "ActName (UKPGA YYYY-chapnum qp)". For acts passed during the reign of Charles III replace qp with kp. ToxicPea (talk) 01:26, 3 May 2025 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Portable Antiquities Scheme

All PAS images should be uploaded, on a recurring basis.

Source to upload from

Example (for File:Iron Age Coin, Stater of the Corieltauvi (FindID 622777).jpg):

Source: https://finds.org.uk/database/ajax/download/id/473330
Catalog: https://finds.org.uk/database/images/image/id/473330/recordtype/artefacts archive copy
Artefact: https://finds.org.uk/database/artefacts/record/id/622777

License

Various Commons-compatible CC licences

Description

User:Fæ was making these uploads on a regular basis (as of January 2021, there were over 614,980 files; I assisted with subsequent categorisation) before they left the project. AFAICT, no work has been one since then. The images are highly valuable from an historic and educational PoV.

See User:Fæ/Project list/PAS for detailed documentation.

Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 18:17, 4 September 2023 (UTC)[reply]

Opinions

I've requested a property for the image ID to track it better: d:Wikidata:Property proposal/Portable Antiquities Scheme image ID -- DaxServer (talk) 13:02, 23 April 2025 (UTC)[reply]

Created as Portable Antiquities Scheme image ID (P13556). Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 18:03, 6 May 2025 (UTC)[reply]

I'll file a bot request to fill the property for the ~600,000+ files in a couple of days. Once that is done, let's review the batch upload -- DaxServer (talk) 19:19, 6 May 2025 (UTC)[reply]

@Pigsonthewing I've filed a bot request at Commons:Bots/Requests/CuratorBot (6) for the SDC to begin with. Also, there are some corrections to be done (see that page for the info). If you some ideas on how we can fix the errors, let me know -- DaxServer (talk) 15:54, 9 May 2025 (UTC)[reply]

Thank you. No idea on the errors. How odd. How are you finding them? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 16:02, 9 May 2025 (UTC)[reply]

@Pigsonthewing The requests to the website from Toolforge are blocked by Cloudflare and thus I don't think I can run this task on Toolforge. I will try to explore alternatives. Bummer :( -- DaxServer (talk) 08:17, 17 May 2025 (UTC)[reply]

How bothersome. Are they blocked completely, or rate limited?

I believe Fæ used to operate a dedicated server rather than using ToolForge. I have no idea if that's why. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 11:59, 17 May 2025 (UTC)[reply]

They're completely blocked. Cloudflare returns a challenge and thus we cannot do much. I can run it on my server. I'll try to do that in a few days -- DaxServer (talk) 12:19, 17 May 2025 (UTC)[reply]

My server doesn't make a difference as well. It seems I can only run it on my system. Let's see how it goes. -- DaxServer (talk) 15:32, 18 May 2025 (UTC)[reply]

Assigned to	Progress	Bot name	Category
DaxServer (talk · contribs)	Backfilling SDC	CuratorBot (talk · contribs)

Śląskie Digitarium (Silesian Digitarium)

The purpose of the Silesian Digitarium project is the digitisation and accessibility of the cultural resources of the Silesian province through a comprehensive system of digitisation, archiving and presentation of collections.

Source to upload from

All images to be uploaded will be added to Wikimedia Poland's Google Drive

License

Creative Commons: Attribution 4.0 ( https://creativecommons.org/licenses/by/4.0/ )

The Wojciech Korfanty Institute is the owner of the property copyrights to the submitted works under Article 12 of the Law on Copyright and Related Rights, as the authors performed these works as part of their official duties while being employees of the Institution.

File sharing consent was sent to: permissions-pl@wikimedia.org, and its number will be added to the template when uploading.

I confirm that above mentioned consent with the list of pictures has been sent to VTR: ticket:2025031410006899. Polimerek (talk) 13:48, 14 March 2025 (UTC)[reply]

Description

Uploading of photos taken as part of the “Silesian Digitarium” project is made as part of the cooperation between Wikimedia Poland and the Wojciech Korfanty Regional Institute of Culture.

I will be adding all photos to Wikimedia Commons using Open Refine and the official Wikimedia Poland account: User:Ada Jakubowska (WMPL). I will do this in several rounds, with the first round being a trial, with a small number of photos (max 40). The planned number of photos to be uploaded is 4 753, and each will include templates such as: - Template:Wojciech Korfanty Institute partnership (Mobile Digitization Centre) - License template with upload consent number Each photo will have a category: Category:Media contributed by the Mobile Digitization Centre and, if possible, a category related to the ritual shown in the photo The source in the description of each file will be: Wojciech Korfanty Regional Institute of Culture (Q28672957)

Names of images in the Commons will be named according to the formula: [event name]_[file number]_RIKiWK_MCD

Ada Jakubowska (WMPL) (talk) 13:39, 14 March 2025 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Museovirasto collection

The Finnish Heritage Agency (Museovirasto) has approximately 386,935 CC-BY 4.0 licensed images available through the Finna.fi platform that could be valuable additions to Wikimedia Commons. These images document Finnish cultural heritage, including historical photographs, artifacts, and architectural documentation.

Source to upload from

Main collection: https://museovirasto.finna.fi/
API access: https://api.finna.fi/v1/

License

386,935 images are licensed under CC-BY 4.0.

Description

Each image comes with extensive metadata through the Finna API including:

Original titles in Finnish and sometimes Swedish
Descriptions and historical context
Dates and time periods
Creator/photographer information
Geographic locations
Subject keywords and classifications
Collection and archival references
Technical image information

The images will be categorized using: 1. Primary tracking categories

Media contributed by the Finnish Heritage Agency
Collections of the Finnish Heritage Agency

2. Content-based categories derived from Finna metadata:

Location categories (from "Aiheen paikka")
Time period categories (from "Aiheen aika")
Subject matter categories (from "Aiheet")
Photographer categories (from creator fields)
Media type categories (from "Aineistotyyppi")
Cultural heritage categories

Technical details:

The Finna API (api.finna.fi) provides full programmatic access
High-resolution images are available
Structured metadata enables automated categorization
API includes pagination and filtering capabilities

Template usage:

Information template for core metadata
Institution template for Museovirasto
Creator templates for photographers
Multilingual descriptions (Finnish/English where available)

The upload will be conducted in phases: 1. Initial test batch (100 images) 2. Themed batches by subject matter 3. Systematic upload of remaining collection

--Apinanaivot (talk) 18:41, 18 January 2025 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category
	Planning

Videos by Psych2Go

The videos of the Channel "Psych2Go"

Source to upload from

https://www.youtube.com/@Psych2Go

License

Description

The videos addresses different psychological-related topics; often with references. The topics are based on psych health, love and social relationships, and more.

PantheraLeo1359531 😺 (talk) 12:17, 13 November 2024 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category
			Category:Videos by Psych2Go

George Michell Bengal Photographs

Source to upload from

All digitised photographs have been uploaded to this Google Drive

License

Creative Commons Attribution-Share Alike 4.0 International

George Michell as the sole owner of the physical photographic slides has released the copyright of that work under the above license.

Detailed discussion with the Wikimedia VRT agent and link to the ticket is available in this section of the Discussion tab of the Wikimedia Grant page.

Description

1. All the metadata including file names and locations is in this Google Sheet.

2. Photos are within folders named after sites photographs were taken. The metadata file has references to locations.

Do the media URLs follow a pattern: Yes, metadata file has details.
Does the site have an API: Not sure. It is a Google Drive folder.
What else could ease uploading: Metadata file.
Did you contact the site owner: Photos are uploaded to my personal Google Drive.
Is there a template that could be used on the file description pages, or should one be created: In the metadata file.

AmitGuha (talk) 18:02, 7 June 2024 (UTC)[reply]

Hello @AmitGuha. Rows 469 and 470 has no Creator. Should it be assumed to be George Michell or be left blank? -- DaxServer (talk) 09:48, 20 September 2024 (UTC)[reply]

Are the GPS coordinates: of the building or of the photographer? -- DaxServer (talk) 10:31, 20 September 2024 (UTC)[reply]

Would you be able to provide some Wikimedia Commons categories for the files?

By default, they'd added to Category:George Michell and Category:Centre for Studies in Social Sciences -- DaxServer (talk) 10:42, 20 September 2024 (UTC)[reply]

Is Commons:Batch uploading/George Michell Bengal Temples request a duplicate? -- DaxServer (talk) 11:08, 20 September 2024 (UTC)[reply]

There are quite some discrepancies in the Sheets compared to the Drive. Please refine them, it's becoming harder for me to validate -- DaxServer (talk) 14:40, 20 September 2024 (UTC)[reply]

Many thanks @DaxServer for the comments above.

1. I will fix all of the missing creator and discrepancies on sheets vs drive and respond here.

2. Yes, I will provide additional Wikimedia Commons categories

I have found some other errors as well. I will need a couple of weeks to fix this and will respond and tag you here when I'm done.

On the other questions:

1. The GPS coordinates are of the buildings

2. Commons:Batch uploading/George Michell Bengal Temples is a duplicate and can be removed

Thanks again! AmitGuha (talk) 21:51, 23 September 2024 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Yevgeny Khaldei photographs

Photographs in public domain by famous Soviet photographer Yevgeny Khaldei.

Source to upload from

https://russiainphoto.ru/search/years-1937-1953/?query=&author_ids=171

License

Public domain - {{PD-Russia}}{{Yevgeny Khaldei as TASS photographer}} (2,453 files)

Description

Do the media URLs follow a pattern?
- Description page: for example https://russiainphoto.ru/photos/180701/, but not all files are in order in the catalog numbering system
- Description pages contain plenty of metadata such as title, caption, source museum, date taken, place taken and keywords, which should be included
- Full size images: for example https://735606.selcdn.ru/thumbnails/photos/2017/09/04/ztjzboam0rskxuxz_1024.jpg, however not every file was uploaded on the same day and some files have urls like https://735606.selcdn.ru/thumbnails/photos/9/6/o/96o52b30d244f515_1024.jpg
Does the site have an API? - Yes
What could ease uploading? The source code for every image page brought up by search query [6] includes source code with this text.

<div class="share share_size_small b-photo__share-block" data-description="" data-title="Подвеска авиабомб в самолет Пе-2, 1941 год" data-url="https://russiainphoto.ru/photos/180589/" data-image="https://735606.selcdn.ru/thumbnails/photos/2017/09/04/qpip0gtkuggujqqo_1024.jpg"></div>

- Information from the field data-title should be the image title
- data-url is the description url
- data-image is the link to the full size image
Did you contact the site owner? - Not necessary
Is there a template that could be used on the file description pages, or should one be created? - Not necessary

Kges1901 (talk) 17:37, 8 August 2024 (UTC)[reply]

I uploaded the photos from before 1946 because Template:PD-Russia-1996 says "created by an employee of TASS, ROSTA, or KarelfinTAG as part of that person’s official duties between July 10, 1925 and January 1, 1946"

999

REAL 💬 ⬆ 09:54, 10 April 2025 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Deepin icons

Deepin's icons.

Source to upload from

https://github.com/linuxdeepin/deepin-icon-theme

License

This work is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 3 of the License, or any later version. This work is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See version 3 of the GNU General Public License for more details.

Description

Do the media URLs follow a pattern?

Sure do. Example: https://github.com/linuxdeepin/deepin-icon-theme/blob/master/Sea/apps/scalable/accessories-text-editor.svg

Does the site have an API?

Yes.

What else could ease uploading?

Not sure.

Did you contact the site owner?

Nope.

Is there a template that could be used on the file description pages, or should one be created?

User:Psiĥedelisto/Deepin icons

Psiĥedelisto (talk • contribs) ^{please always ping!} 18:51, 3 July 2023 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category
	Half done		Category:Deepin Icon Theme

@Psiĥedelisto: Hi! I looked deeper into this and uploaded the majority of the icons. Unfortunately, some icons are covered by copyright, as they are derivative work. --PantheraLeo1359531 😺 (talk) 19:17, 20 January 2024 (UTC)[reply]

IBM Research on YouTube

Source to upload from

https://www.youtube.com/@ibmresearch/videos

License

Virtually all uploads to the IBM Research channel are licensed under the Creative Commons Attribution 3.0 Unported license, per the License tag in the description of each video.

Description

784 videos (and counting, as of the time I'm writing this) of pure IBM and technology-related gold. Lots of great photography and headshots to extract from these. Some of the content therein may contain non-free elements over the de minimis threshold, but from what I've watched so far those are few and far in between. Would be trivial to download all videos using youtube-dl; reencoding each video to fit within the 100 MB upload limit is a different story however. DigitalIceAge (talk) 04:52, 17 November 2023 (UTC)[reply]

The limit is now 5 GB. Yann (talk) 20:12, 17 April 2025 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category
			Videos by IBM Research

Try to upload files from time to time --PantheraLeo1359531 😺 (talk) 18:16, 22 August 2024 (UTC)[reply]

@DigitalIceAge 437 files are Category:Videos by IBM Research uploaded to date --PantheraLeo1359531 😺 (talk) 16:23, 27 March 2025 (UTC)[reply]

Newspapers by Feureau in Internet Archive

Source to upload from

https://archive.org/details/%40feureau

License

Public domain

Description

This user has uploaded more than a thousand old newspapers from Indonesia, as well as Dutch magazine Tong Tong.

Please help importing the newspaper here, they would make a great addition to Category:Newspapers_of_Indonesia

Bennylin (yes?) 18:29, 18 February 2023 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Babad Diponegoro from Internet Archive

Source to upload from

Volume 1 https://archive.org/details/eap-1268-babad-diponegoro-v-1-0001/Babad%20Diponegoro%20Jilid%201/EAP1268_Babad_Diponegoro_V1_0006.jpg 15+ GB of jpgs
Volume 2 https://archive.org/details/eap-1268-babad-diponegoro-v-2-0001 ? GB of jpgs

License

Public Domain

Description

The IA didn't provide DJVU nor PDF format, only zipped JPGS (1429 files and 1303 files) Bennylin (yes?) 06:57, 15 February 2023 (UTC)[reply]

I found another file, same, no pdf/djvu

https://archive.org/details/supratman

Opinions

There are PDF files, but over 500 MB each. Yann (talk) 20:26, 17 April 2025 (UTC)[reply]
Yes. I already uploaded them here. In 2023 there were no PDF. We can close this request. Thank you for looking this up. Bennylin (yes?) 07:57, 18 April 2025 (UTC)[reply]

Assigned to	Progress	Bot name	Category

CC0 ant micro-CTs

X-ray microtomograms of ants

Source to upload from

Dryad Subject Area: cybertype - Blacklight Search Results

License

Creative Commons CC0 License (Q6938433) (rationale)

Description

what makes it valuable to Wikimedia Commons?
- expands the selection of biology-focussed STLs in the repository.
Do the media URLs follow a pattern?
- https://datadryad.org/stash/downloads/file_stream/[five-digit number]
Does the site have an API?
- According to https://datadryad.org/stash/our_platform#architecture-and-implementation ...maybe?
What else could ease uploading?
- If the files are categorised per 'stash' (dataset/DOI) with informational text or template, then a subset of files can be uploaded with the others easy to find and also upload later, similarly to the PLoS import
- See also https://antwiki.org/wiki/index.php?title=Special:CargoQuery&limit=500&offset=100&tables=Economolab3D&fields=_pageName%3DPage%2CName%3DName%2CGenus%3DGenus%2CCaste%3DCaste%2CView%3DView%2CLink%3DLink%2CSpecimenIdentifier%3DSpecimenIdentifier%2CInstitution%3DInstitution%2CNotes%3DNotes&max+display+chars=300, derived from the source above
Did you contact the site owner?
- no, not needed for CC0 license
Is there a template that could be used on the file description pages, or should one be created?
- {{Sketchfab}}

Arlo James Barnes 23:01, 10 June 2022 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Denkmalatlas Niedersachsen

Images of cultural monuments in Lower-Saxony, Germany.

Source to upload from

https://denkmalatlas.niedersachsen.de/viewer/

License

CC BY-SA 4.0

Description

Images of cultural monuments in Lower-Saxony, Germany, from the "Denkmalatlas Niedersachsen" project of the Lower Saxony State Office for Heritage Conservation. The project offers exterior shots of the monuments. Photos shot from public space are permitted in accordance with the freedom of panorama in Germany. For published photos shot on private property, the State Office has the consent of the property owner. In the "Denkmalatlas", all photos are published with the license CC BY-SA 4.0.

Timk70 (talk) 16:00, 10 June 2022 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Editora Fi

Source to upload from

https://www.editorafi.org/catalogo

License

CC BY-SA 4.0

Description

Open Access/CC books perfect for Wikisource. I believe that it is everything on Google Drive. It needs an specific template. Erick Soares3 (talk) 12:36, 9 March 2022 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

SciELO Books

Source to upload from

http://books.scielo.org/; https://archive.org/details/scielobooks; https://archive.org/details/@scielo_books
License

Several types of Creative Commons (including non-commercial) and Public Domain.

Description

SciELO Books, part of Scielo Brazil (also an amazing source for Wikisource), have a partnership with several academic publishers to release or re-release their works on Open Access, be CC or Public Domain.

Since it is clearly legal, it should be an amazing resource for Wikisource and the Wikimedia in general.

The bot should be able to read the archive and select the ones with Wiki Commons friendly licenses. Internet Archive some works released as CC BY-4.0 are registered as non-commercial (example). A similar thing also happens on the main website: 1 and 2. Would be nice if the bot could compare the main website and the Internet Archive collection for missing files and check at least once a month for new works released into Wiki friendly licenses.

It is necessary an official template. Thanks, Erick Soares3 (talk) 20:38, 6 March 2022 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Biblioteca Digital Hispánica

Source to upload from: Photography collection from the Biblioteca Digital Hispánica: search query
- Do the media URLs follow a pattern? metadata permalink, viewer permalink, JPEG deep link
- Does the site have an API? No
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) The HTML is quite well-formed and follows an homogeneous structure, although metadata tabulation is a bit weird.
- Did you contact the site owner? No

Describe the works to be uploaded in detail (audio files, images by …): This request is for a subset of this Digital Library covering photographies and engravings. Note that the JPEG deep link provided above is valid only to fetch the first page of the document. For this collection, most (all?) works are a single page.

Which license tag(s) should be applied? It depends on the work. I think it should generally be PD-old-assumed, and in some cases PD-old-70 and PD-old-100.

Is there a template that could be used on the file description pages? Do you think a special template should be created? I created a manual sample here: File:Retrato de Mariano Ballestero (1869).jpg

I already have a scraper and (work in progress) page generator for this collection. So I can help to provide everything in the required format. Anyway, I think the bulk of pending work is probably identifying author and the right license tag for each work.

MarioGom (talk) 21:15, 12 October 2020 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Perry–Castañeda Library Map Collection

Source to upload from: http://legacy.lib.utexas.edu/maps/ams/
- Do the media URLs follow a pattern?
  The urls themselves, so far as I can work out, don't, but in the same way as in Adobe Acrobat Pro you can set it to go down a list of web links to generate a single pdf, a bot may be able to too
- Does the site have an API?
  Bit technical, but I dont' think so
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?)
  I don't know
- Did you contact the site owner?
  No
Describe the works to be uploaded in detail (audio files, images by …): vast series of maps generated by the US Army Map Service (i.e., PD-USGov-Military) in the Perry–Castañeda Library Map Collection, The University of Texas at Austin
Which license tag(s) should be applied? PD-USGov-Military
Is there a template that could be used on the file description pages? Do you think a special template should be created? In terms of the file naming convention, this could follow that of the site, i.e, the top of each page has the series, a credit to the US AMS, and the date, then each map file has the name of the map, the sheet number (for the index pages, cross-references from adjoining maps etc), and the scale

NB there are already some files at Category:India maps by U.S. Army Map Service (plus various other individual uploads etc within Category:Maps by the United States Army Map Service), and it looks from below on this page and eg this commons image that "Slick-o-bot" may have been used in 2012 to upload some or all of these (I'm most keen on the various Japan-related maps (especially the 3x Honshu 1:50,000 series) but imagine every region would benefit).
This would be a mind-bogglingly great addition, thank you, Maculosae tegmine lyncis (talk) 14:08, 13 August 2020 (UTC)[reply]

PS, these are much more detailed than google maps - and the labelling is in English (with some Japanese too), Maculosae tegmine lyncis (talk) 19:27, 21 August 2020 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Claremont Colleges Digital Library

Source to upload from: https://ccdl.libraries.claremont.edu/digital/collection/bce
- Do the media URLs follow a pattern? Yes
- Does the site have an API? Not sure
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) Not sure
- Did you contact the site owner? No

Describe the works to be uploaded in detail (audio files, images by …):

All photos in the Boynton Collection of Early Claremont, all of which are dated prior to 1925. If it's not too much trouble, it would also be very nice to have all photos in the Claremont Colleges Photo Archive and City of Claremont History Collection dated prior to 1925.

Which license tag(s) should be applied? {{PD-US-expired}}

Is there a template that could be used on the file description pages? Do you think a special template should be created? Not sure

Sdkb (talk) 07:42, 8 August 2020 (UTC)[reply]

Opinions

Were these photos published prior to 1925, or merely taken prior to then? Publication needs to be pre-1925 for {{PD-US-expired}} to be allowed. Pi.1415926535 (talk) 08:25, 8 August 2020 (UTC)[reply]

@Pi.1415926535: The about page states The collection ... is believed to have come to Pomona College included with the papers of Charles Luther Boynton, a Pomona College alumnus and missionary to China. Boynton himself graduated from Pomona around 1900. I can't say for sure the year his papers came into possession of the college, though (which I assume would be the date of publication?). The library would probably tell us if we asked, though. Sdkb (talk) 05:47, 10 August 2020 (UTC)[reply]

Acquisition by the college would not be considered publication for the purposes of copyright. Only use in a publicly released printed material, or on a webpage, is considered publication. Pi.1415926535 (talk) 06:48, 10 August 2020 (UTC)[reply]

@Pi.1415926535: does being added to a library not count as publication? The collection has presumably been housed in the special collections department and publicly available to anyone who requested access since it was obtained. Sdkb (talk) 20:55, 10 August 2020 (UTC)[reply]

A collection merely being in a library does not constitute publication, by my reading. Under copyright law, publication is the distribution of copies or phonorecords of a work to the public by sale or other transfer of ownership or by rental, lease, or lending. Offering to distribute copies or phonorecords to a group of people for purposes of further distribution, public performance, or public display also constitutes publication. (From here.) Is the death date of Boynton known? If it was before 1950, then {{PD-old-70}} applies. Pi.1415926535 (talk) 23:14, 10 August 2020 (UTC)[reply]

@Pi.1415926535: According to here, Boynton died in 1961, so not quite. The above would seem to me to indicate being in a library counts, though, because of lending, which is what a library does. Sdkb (talk) 19:11, 11 August 2020 (UTC)[reply]

A collection in the library would be the originals (not copies) and is likely for use only in the library (not lending). I understand that you wish to have this collection available on Commons, but from the available evidence I do not believe the images are public domain. Pi.1415926535 (talk) 20:58, 11 August 2020 (UTC)[reply]

Assigned to	Progress	Bot name	Category

Balinese Lontar from Internet Archive

Source to upload from: http://archive.org/details/Bali
- Do the media URLs follow a pattern? yes
- Does the site have an API? yes
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) N/A
- Did you contact the site owner? yes

Describe the works to be uploaded in detail (audio files, images by …):
- Balinese Lontar (palm-leaf manuscripts) from the Internet Archive's Bali collection
- Each manuscript is a PDF containing photographs of the originals
- This batch upload is in connection with an active project grant.

Which license tag(s) should be applied?

{{PD-scan}}, following the behavior of the ia-upload tool.

Is there a template that could be used on the file description pages? Do you think a special template should be created?

Yes. I will follow the ia-upload template closely when doing the batch upload. I will use a short python script that aggregates info from the Internet Archive API and sends each upload request via pywikibot. If necessary I will create a bot account for this purpose. There are approximately 2700 items to upload.

Lautgesetz (talk) 01:03, 4 July 2020 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Japanese Homes and their surroundings

Source to upload from: List of files, List of illustrations with names assigned to each number. It would be really nice if the figures contained their original names in teh uploaded filenames.
- Do the media URLs follow a pattern?:
  - Yes. https://www.gutenberg.org/files/52868/52868-h/images/fig001.jpg - https://www.gutenberg.org/files/52868/52868-h/images/fig307.jpg
  - Note that combined figures fig114_117.jpg and fig188_192.jpg do not follow this pattern; titldeco.jpg is a lower-res red version of one of the other illustrations, used as a frontispiece, and can be omitted.
- Does the site have an API?:
  - I assume that Gutenberg has an API. If someone can point me at instructions on how to use it with Commons, I might be able to do this myself; I assume this is a beaten path...
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?):
- Did you contact the site owner?:
  - No, for Gutenberg this seems redundant.
  - I uploaded some manually already, with permission, from another site (names files with pattern https://www.kellscraft.com/JapaneseHomes/JapanHomes001.jpg, to JapanHomes301.jpg, 129-130 are duplicates, figure numbers do not align with file names, so combined illustrations cause no disruption to sequential numbering). The Gutenberg images are in better in many, but not all, cases (higher-res, better scan).
  - The same book is also at [7], but the images seem to be worse.

Describe the works to be uploaded in detail (audio files, images by …):
- All of the illustrations from a PD book, jpgs, architectural line drawings by Creator:Edward S. Morse.

Which license tag(s) should be applied?:
- {{PD-old-70-1923}}
- note: five years from PD-100

Is there a template that could be used on the file description pages? Do you think a special template should be created?
- {{Creator:Edward S. Morse}} If uploaded to Category:Japanese Homes and Their Surroundings (1885 book), I will manually categorize them and add descriptions. A special template seems redundant.

Thank you! HLHJ (talk) 04:17, 4 February 2020 (UTC)[reply]

I uploaded 167-307 from the website. For each image I tried to fetch the description by getting the text of all the divs which mention that figure.

999

REAL 💬 ⬆ 15:41, 12 April 2025 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Baseball Hall of Fame

The National Baseball Hall of Fame and Museum is releasing a larger portion of their collection online lately, many are in the public domain. See for example this collection on Honus Wagner https://collection.baseballhall.org/PASTIME/wagner-honus-0?page=7&fbclid=IwAR2cYYBeEMTsN_PGJFCXB5qqYoTtcBBCPkGwFGh3NqUbNtYYww7OWHizdvA

Is there a practical way to batch extract and upload files that are tagged with "http://rightsstatements.org/vocab/NoC-US/1.0/" under the "Copyright note" section? They basically confirm which files are in the public domain. Or they will sometimes post in that same section "The National Baseball Hall of Fame and Museum is not aware of any U.S. copyright or any other restrictions in the documents."

Oaktree b (talk) 02:16, 23 November 2019 (UTC)[reply]

Opinions

Unfortunately I think this collection is now being locked down. The link above 404s and there is no obvious way to get to a collection from the root website (best link is https://baseballhall.org/the-museum/collections/photo-archives). – BMacZero (🗩) 07:05, 12 January 2025 (UTC)[reply]

Assigned to	Progress	Bot name	Category

NPGallery

Source to upload from: https://npgallery.nps.gov/
- Do the media URLs follow a pattern? https://npgallery.nps.gov/AssetDetail/<GUID>
- Does the site have an API? Unknown
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) Unknown
- Did you contact the site owner? No

Describe the works to be uploaded in detail (audio files, images by …):

"NPGallery supports a wide array of digital asset file types (images, MS office formats, adobe pdfs, audio files, videos)." We would, I think, be primarily interested in their photographs of national parks.

Which license tag(s) should be applied?

{{PD-USgov}} may apply to many images, but they need to be checked individually. This could probably be automated to some degree.

Is there a template that could be used on the file description pages? Do you think a special template should be created?

Standard templates such as {{Photograph}} should be acceptable.

This was spotted by Animalparty on COM:VP. BMacZero (talk) 00:12, 22 January 2019 (UTC)[reply]

Opinions

Comments by Animalparty.

{{PD-USGov}} would be the most inclusive template, but is rather vague. More specific templates include {{PD-USGov-NPS}} and {{PD-USGov-Interior}}. Any Photographer field that says "NPS Staff" or "NPS Photo" (e.g. [8]) should automatically get PD-USGov-NPS.
I think {{Photograph}} or {{Information}} are fine, ideally with detailed semi custom fields for keywords, collection, location, etc., as seen in the Library of Congress images uploaded by User:Fæ (example).
The more pre- or auto-categorization, or at least clearly noting collection, yeear/decade, geographic unit, etc., the better, else we dump thousands of unsorted of images into already cluttered categories like Yosemite National Park.
There may be overlap with some material on Archives.gov , individual National Park Flickr feeds/websites, and such material already uploaded. But I think the value of the images uploaded at their largest file size and with curated metadata outweigh the inconvenience of some duplication.
Many files have geographical coordinates, but I suspect that many are generic coordinates of the center of the National Park or Monument, rather than being unique to the photograph.
Thanks for initiating this, sorry if these comments are basic/obvious to experienced mass uploaders. --Animalparty (talk) 01:29, 22 January 2019 (UTC)[reply]

On some more inspection, certain images may be a bit problematic in terms of copyright, namely works of art (e.g. paintings and sculptures) not explicitly credited to NPS employees, but that are nonetheless labeled "Public domain:Full Granting Rights". Some of these appear to be created by Artist-in-Residence programs (e.g. this gallery and this one), and from browsing elsewhere it appears that different parks may have different rules regarding copyrights. Rocky Mountain National Park states "Artists are also required to provide the copyright for this artwork to the National Park Service. The National Park Service will not allow the commercial use of any donated artwork once it is selected and accessioned into the Park's permanent museum collection", which is a restriction against public domain. Perhaps no art from Rocky Mountain was transferred to NPGallery? These 2 images from the U.S.S. Arizona memorial are labeled PD on NPGallery, yet on a different NPS page their status is ambiguous, with the included usage disclaimer "Multimedia credited with a copyright symbol (indicating that the creator may maintain rights to the work) or credited to any entity other than NPS must not be presumed to be public domain; contact the host park or program to ascertain who owns the material" (emphasis added).

Side note: I think every photograph I've viewed on NPGallery has the Copyright disclaimer "Permission must be secured from the individual copyright owners to reproduce any copyrighted materials contained within this website. Digital assets without any copyright restrictions are public domain.", but every file is also labeled Public domain in the Constraints Information.

Another snag I've noticed, just from browsing the term "Artist", are that some images are scans/photographs from newspapers that were most likely not originally created by Federal employees (although the derivative scans/photos are): for instance Louis Grell illustration album, with cartoons by Louis Grell published in World War I.[9] These are still PD via pre-1924 publication (and possibly by {{PD-USGov-Military}}), but it hinders accurate bot-designation of PD template.

And public domain rationale is ambiguous on this vido, with Copyright" "Photo courtesy of Betty Maya Foott, Colorado Plateau Dark Sky Cooperative" (so, probably not a federal employee), yet is nonetheless labeled "Public domain:Full Granting Rights". I may have just found a relative handful of exceptions. But there are also probably a good deal of historical photographs that are PD-1923 or PD-no-notice yet not US Government works. Perhaps a generic umbrella template similar to {{Flickr-no known copyright restrictions}} could be used to encapsulate different possibilities, like {{PD-NPGallery}}.

I think it would be a good idea to contact someone at NPGallery to double check that all media labeled public domain is in fact public domain, for some reason, especially when rationale is ambiguous or lacking. We also might want to consider not transfering the somewhat intimidating, potentially misleading Copyright message "Permission must be secured from the individual copyright owners to reproduce any copyrighted materials contained within this website. Digital assets without any copyright restrictions are public domain." This may be a liability disclaimer on NPGallery's end, but ideally, everything we transfer to Commons would be in the public domain, and so no permission need be secured. --Animalparty (talk) 11:45, 25 January 2019 (UTC)[reply]

Working on adapting my bot to handle this. I'll contact them, and also start with only things that are obviously PD. BMacZero (talk) 17:50, 9 February 2019 (UTC)[reply]

I e-mailed NPGallery a while back about the public domain statuses of images and neglected to share here. Unfortunately got a not-too-helpful response essentially saying that the licenses and attributions are not "consistent" and "there is not a good way to assure an asset id is truly in the public domain, or not". We'll have to figure out what types of signals we can rely on to decide whether {{PD-USGov-NPS}} or other templates apply. Of course, publication pre-1924 will be a good one to start. BMacZero (talk) 04:30, 11 April 2019 (UTC)[reply]

I'm currently harvesting a list of all the images. It's going a bit slow but it should only a take a few days. After that I'll start downloading the metadata, which may take several days. BMacZero (talk) 04:45, 12 April 2019 (UTC)[reply]

Ah, a shame about the inconsistent licensing criteria. I guess pre-1924 and files credited to "NPS staff" or similar can be prioritized for now. --Animalparty (talk) 19:13, 12 April 2019 (UTC)[reply]

Started downloading the item metadata. You can check on the progress on this fun page I made. BMacZero (talk) 15:49, 13 April 2019 (UTC)[reply]

BRFA filed (Commons:Bots/Requests/BMacZeroBot 6). BMacZero (talk) 05:35, 10 May 2019 (UTC)[reply]

Started uploading last night, will probably be ongoing for quite a while. See Category:Images from NPGallery to check to help with validation and categorization! – BMacZero (🗩) 16:35, 29 June 2019 (UTC)[reply]

Assigned to	Progress	Bot name	Category
User:BMacZero	In progress	User:BMacZeroBot	Category:Images from NPGallery to check

See Also

APPLAUSE

Source to upload from: https://www.plate-archive.org/applause/
- Do the media URLs follow a pattern? https://www.plate-archive.org/objects/dr.3/ + plates or logbooks or notes or envelopes + /101_xxxx/ (x is a variable number)

Does the site have an API? Yes: 101_xxxx (x is a variable number)
What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) https://www.plate-archive.org/query/
Did you contact the site owner? No

Describe the works to be uploaded in detail (audio files, images by …): Historical astronomical plates, logbooks, envelopes or notes https://www.plate-archive.org/applause/info/gallery/ (we don't need to upload all, but I think the plates would be insteresting.
Which license tag(s) should be applied?

Plates: {{CC-0}} for example https://www.plate-archive.org/objects/dr.3/plates/101_3309/
Others: {{CC-BY-4.0}} for example https://www.plate-archive.org/objects/dr.3/logbooks/101_53/

The database is licensed under CC-0 (https://www.plate-archive.org/applause/project/disclaimer/)

Is there a template that could be used on the file description pages? Do you think a special template should be created? Yes, I think a template should be created.

Habitator terrae 🌍 16:37, 27 October 2018 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

PauloGuedes

Source to upload from: Institution:Arquivo Municipal de Lisboa
- Do the media URLs follow a pattern? yes:

This url generates 94 results pages, each linking to 10 individual image pages. Each image page url is

http://arquivomunicipal2.cm-lisboa.pt/X-arqWeb/ContentPage.aspx?ID=code&Pos=1&Tipo=PCD

while the image in it is at

http://arquivomunicipal2.cm-lisboa.pt/X-arqWeb/ContentDisplay.aspx?ID=code&Pos=1&Tipo=PCD&Thb=0

with code being a 20-digit lower-case hex number — which has no bearing with the official identification references (cota — see below).

Does the site have an API? dunno

- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) consistent, machine-generated HTML (parsable, even if not necessarily valid)
- Did you contact the site owner? No
Describe the works to be uploaded in detail (audio files, images by …): Smallish batch (711, according to inventory, or 933, according with the database search report) of scanned b/w photos in various hardcopy formats.

Which license tag(s) should be applied? {{PD-old}} Creator:Paulo Guedes

Is there a template that could be used on the file description pages? Do you think a special template should be created? {{AMLx}}; it needs to be fed at least {{{cota}}} (given also as código de referência), a slashed crumbthread-like alphanumeric string of variable length; other values to be (trivially) extracted from each image page are:

Título
Assunto
Data(s)
Dimensão e suporte
Nota(s)
Cotas antigas or Cotas or Cota(s)

The filenames can be constructed from Título (possibly trimmed) and the two last crumbs of {{{cota}}}, in parenthesis, devoided of the slash (which is one of the Cotas)

-- Tuválkin ✉ ✇ 16:54, 30 June 2018 (UTC)[reply]

Never mind. The bunch of imcompetents at CMLarq changed their software and “of course” old urls wont work. As they are also copyfraud goons, the new search functionality throws us back to the 1970s and it’s even less usable. Better visit their facilities in Lisbon (now rehoused in a modern neighbourhood becuae their historic HQ had to be converted into a tourist trap) and fiddle around with a microfilm viewer or some such nonsense. -- Tuválkin ✉ ✇ 22:03, 5 August 2024 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

VOA News files

Source to upload from: https://web.archive.org/web/*/https://www.voanews.com/mp3/voa/english/nnow/NNOW_HEADLINES.mp3
- Do the media URLs follow a pattern? They all have the same name. The date when archived is given in 14 digits, with the first eight digits being the year, month, and day respectively, with the remaining digits being the time of day archived, in UTC.
- Does the site have an API? Don't know.
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) Don't know.
- Did you contact the site owner? No need to, since U.S. government works so public domain.

Describe the works to be uploaded in detail (audio files, images by …): VOA world news headline newscast audio files for (almost) every day spanning from 5 May 2009 to 6 July 2019.

Which license tag(s) should be applied? Template:PD-USGov-VOA

Is there a template that could be used on the file description pages? Do you think a special template should be created? Just use the standard one. Upload as "VOA News Headlines (MONTH DAY, YEAR)". If possible, upload them in FLAC, WAV, and OGG.

– Illegitimate Barrister (talk • contribs), 13:07, 26 May 2019 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

HiRISE

Source to upload from: https://www.uahirise.org/catalog/
- Do the media URLs follow a pattern? Yes! (Based on the catalog ID)
- Does the site have an API? No!
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?)
  Full index tree of the images on the site is accessible via:
  https://hirise-pds.lpl.arizona.edu/PDS/RDR/ESP/
  
  With extra files in:
  https://hirise-pds.lpl.arizona.edu/PDS/EXTRAS/RDR/ESP/
  
  Each image file *.JP2 (sample [Big file]) accompanies the additional information in a separate label file *.LBL in PDS format (sample)
- Did you contact the site owner? Nope!

Describe the works to be uploaded in detail (audio files, images by …):
Images by HiRISE (High Resolution Imaging Science Experiment)

Which license tag(s) should be applied?

As explained in each image's description page for example: "All of the images produced by HiRISE and accessible on this site are within the public domain: there are no restrictions on their usage by anyone in the public, including news or science organizations. We do ask for a credit line where possible: NASA/JPL/University of Arizona"
PD-USGov-NASA or a variation of it to include JPL and University of Arizona must be used.

Is there a template that could be used on the file description pages? Do you think a special template should be created?

There is no template yet. It must be created to include all the relevant data e.g. Acquisition date, Latitude , Longitude , etc. from the label files.

Note: Due to JPEG2000 not being currently supported on Wikimedia Commons, a conversion to PNG is also needed. File sizes may be large!

Meisam (talk) 21:58, 20 June 2018 (UTC)[reply]

Opinions

Support Seems like an interesting project --Kristbaum (talk) 16:00, 8 May 2019 (UTC)[reply]
Info Template:PD-NASA-HiRISE has been created for these images! -- Meisam (talk) 17:31, 11 May 2019 (UTC)[reply]
@Meisam: - I am interested in pursuing this. I think it would be a logical extension of my work with uploading from ESRS. Do you have any suggestion as to how we most efficiently store the PDS data with each image? Askeuhd (talk) 08:28, 6 June 2022 (UTC)[reply]
@Askeuhd: I don’t have any good solutions. I suppose we can store them as tEXt chunks in the PNG image and also add them in a table (using wiki templates) to the image description page. -- Meisam (talk) 11:30, 6 June 2022 (UTC)[reply]
@Meisam: - I would personally much prefer the latter option, I fear that the former option may not be very user friendly. We will also have to parse as much of the data as possible to SDC. I will try to think of a suitable paradigm. Askeuhd (talk) 11:40, 6 June 2022 (UTC)[reply]

Assigned to	Progress	Bot name	Category

PDS data import proposal

Proposal for the import of PDS data for each image, to ensure as much as possible is added to SDC and necessary data for the user is displayed prominently in the wikitext.

I propose that the entire LBL file is imported as a collapsible text field in the template for each file, preserving all formatting and indentation, so that researches or other users familiar with the PDS format may be able to utilize plain text search for the values we will not be able to add to SDC, similar to this:

Raw Planetary Data System data
`PDS Content of LBL file`

In addition to this, I have broken down the example file, to try and maximize possible SDC data migration, as well as adding some of the data to a custom wikidata template for this particular import. I am highly interested in any suggestions. I take the libery of pinging @Multichill: as you have previously been very helpful in a similar endeavour with the ISS photos. I hope you would be interested in adding your valuable input here as well.

I will make a couple of example files in a few days or so to test the SDC structure and a potential template, before starting any peliminary coding, so the concepts can be tested out.

Reference to be set as stated in (P248) --> "Planetary Data System" for all SDC values imported from PDS.

All PDS identifiers can be looked up here for clarification. The LBL example file also contains some in-line comments.

Paradigm for importing PDS data - based on example ESP_053850_2170_RED.LBL
PDS Identifier	PDS Value in example file	Commons/SDC identifier	Commons/SDC value
PDS_VERSION_ID	PDS3	N/A	N/A
NOT_APPLICABLE_CONSTANT	-9998	N/A	N/A
DATA_SET_ID	"MRO-M-HIRISE-3-RDR-V1.1"	N/A	N/A
DATA_SET_NAME	"MRO MARS HIGH RESOLUTION IMAGING SCIENCE EXPERIMENT RDR V1.1"	part of the series (P179)	Appropriate wikidata-entity for this property (to be created)
PRODUCER_INSTITUTION_NAME	"UNIVERSITY OF ARIZONA"	affiliation (P1416) as a qualifier to creator (P170)	University of Arizona (Q503419)
PRODUCER_ID	"UA"	N/A	N/A
PRODUCER_FULL_NAME	"ALFRED MCEWEN"	creator (P170)	"some value" --> "Alfred McEwen"
OBSERVATION_ID	"ESP_053850_2170"	catalog code (P528) and {{NASA-image}}	"some value" --> "ESP_053850_2170", qualified with catalog (P972) and an appropriate wikidata-entity for this property (to be created).
PRODUCT_ID	"ESP_053850_2170_RED"	N/A	N/A
PRODUCT_VERSION_ID	"1.0"	N/A	N/A
INSTRUMENT_HOST_NAME	"MARS RECONNAISSANCE ORBITER"	location of creation (P1071) & location of the point of view (P7108)	Mars Reconnaissance Orbiter (Q183160)
INSTRUMENT_HOST_ID	"MRO"	N/A	N/A
INSTRUMENT_NAME	"HIGH RESOLUTION IMAGING SCIENCE EXPERIMENT"	captured with (P4082)	HiRISE (Q1036092)
INSTRUMENT_ID	"HIRISE"	N/A	N/A
TARGET_NAME	"MARS"	depicts (P180)	Mars (Q111)
MISSION_PHASE_NAME	"EXTENDED SCIENCE PHASE"	significant event (P793)	Appropriate wikidata-entity for this property (to be created)
ORBIT_NUMBER	53850	orbits completed (P1418)	value: 53850 possibly qualified with type of orbit (P522) --> areocentric orbit (Q3884965)
SOURCE_PRODUCT_ID	(ESP_053850_2170_RED0_0, ESP_053850_2170_RED0_1, ESP_053850_2170_RED1_0, ESP_053850_2170_RED1_1, ESP_053850_2170_RED2_0, ESP_053850_2170_RED2_1, ESP_053850_2170_RED3_0, ESP_053850_2170_RED3_1, ESP_053850_2170_RED4_0, ESP_053850_2170_RED4_1, ESP_053850_2170_RED5_0, ESP_053850_2170_RED5_1, ESP_053850_2170_RED6_0, ESP_053850_2170_RED6_1, ESP_053850_2170_RED7_0, ESP_053850_2170_RED7_1, ESP_053850_2170_RED8_0, ESP_053850_2170_RED8_1)	N/A	N/A
RATIONALE_DESC	"Monitoring new impact site"	to be added to {{En}} in main template - We might also go over all LBL files to search for obvious depicts (P180) statements	"PDS description: Monitoring new impact site"
SOFTWARE_NAME	"PDS_to_JP2 v3.19 (1.53 2012/01/24 03:07:27)"	I was unable to find an appropriate wikidata property here, but I feel like there should be one	?
OBJECT = IMAGE_MAP_PROJECTION
DATA_SET_MAP_PROJECTION	"DSMAP.CAT"	N/A	N/A
MAP_PROJECTION_TYPE	"EQUIRECTANGULAR"	I was unable to find the appropriate wikidata property for "projection", I might be looking in the wrong place. spatial reference system (P3037) was the closest I got	equidistant cylindrical projection (Q1326965)
PROJECTION_LATITUDE_TYPE	PLANETOCENTRIC	N/A (coordinates given by globe planetocentric Martian coordinates (Q106948918) are planetocentric)	N/A
A_AXIS_RADIUS	3389.5743490888 <KM>	N/A (simply the mean readius of Mars)	N/A
B_AXIS_RADIUS	3389.5743490888 <KM>	N/A (simply the mean readius of Mars)	N/A
C_AXIS_RADIUS	3389.5743490888 <KM>	N/A (simply the mean readius of Mars)	N/A
COORDINATE_SYSTEM_NAME	PLANETOCENTRIC	N/A (coordinates given by globe planetocentric Martian coordinates (Q106948918) are planetocentric)	N/A
POSITIVE_LONGITUDE_DIRECTION	EAST	N/A (coordinates given by globe planetocentric Martian coordinates (Q106948918) are east-positive)	N/A
KEYWORD_LATITUDE_TYPE	PLANETOCENTRIC	N/A (coordinates given by globe planetocentric Martian coordinates (Q106948918) are planetocentric)	N/A
POSITIVE_LONGITUDE_DIRECTION	EAST	N/A (coordinates given by globe planetocentric Martian coordinates (Q106948918) are east-positive)	N/A
KEYWORD_LATITUDE_TYPE	PLANETOCENTRIC	N/A (coordinates given by globe planetocentric Martian coordinates (Q106948918) are planetocentric)	N/A
CENTER_LATITUDE	35.000 <DEG>	See below	See below
CENTER_LONGITUDE	180.000 <DEG>	coordinates of depicted place (P9149) - not completely sure though, as it the example specifically comments that the location is the center of the projection not necessarily the center of the image. So it may not be so helpful to import this value as the coordinates of depicted place (P9149) - see bounding values below	`{ "latitude": 35, "longitude": 180, "precision": 0.001, "globe": "http://www.wikidata.org/entity/Q106948918" }`
LINE_FIRST_PIXEL	1	N/A	N/A
LINE_LAST_PIXEL	32134	N/A	N/A
SAMPLE_FIRST_PIXEL	1	N/A	N/A
SAMPLE_LAST_PIXEL	25483	N/A	N/A
MAP_PROJECTION_ROTATION	0.0 <DEG>	N/A	N/A
MAP_RESOLUTION	236636.93053097 <PIX/DEG>	angular resolution (P3439)	converted to milliarcseconds/pixel (1/236636.93053097*3600000) value: 15.21317907531, unit: milliarcsecond (Q21500224)
MAP_SCALE	0.25 <METERS/PIXEL>	I was unable to find an appropriate wikidata property here, something like "ground sample distance" or similar - I think it should be included as custom field in the wikitext template for each image, as it is a very commonly needed figure	N/A
MAXIMUM_LATITUDE	36.973920949851 <DEG>	coordinates of northernmost point (P1332)	`{ "latitude": 36.973920949851, "longitude": 148.23651113052, "precision": 0.000000000001, "globe": "http://www.wikidata.org/entity/Q106948918" }` <-- longitude set to westermost longitude clarified by syntax clarification (P2916) qualifier.
MINIMUM_LATITUDE	36.838131126084 <DEG>	coordinates of southernmost point (P1333)	`{ "latitude": 36.838131126084, "longitude": 148.36797304112, "precision": 0.000000000001, "globe": "http://www.wikidata.org/entity/Q106948918" }` <-- longitude set to easternmost longitude clarified by syntax clarification (P2916) qualifier.
LINE_PROJECTION_OFFSET	8749396.5 <PIXEL>	N/A	N/A
SAMPLE_PROJECTION_OFFSET	6157087.5 <PIXEL>	N/A	N/A
EASTERNMOST_LONGITUDE	148.36797304112 <DEG>	coordinates of easternmost point (P1334)	`{ "latitude": 36.973920949851, "longitude": 148.36797304112, "precision": 0.000000000001, "globe": "http://www.wikidata.org/entity/Q106948918" }` <-- latitude set to maximum latitude clarified by syntax clarification (P2916) qualifier.
WESTERNMOST_LONGITUDE	148.23651113052 <DEG>	coordinates of westernmost point (P1335)	`{ "latitude": 36.838131126084, "longitude": 148.23651113052, "precision": 0.000000000001, "globe": "http://www.wikidata.org/entity/Q106948918" }` <-- latitude set to minimum latitude clarified by syntax clarification (P2916) qualifier.
GROUP = TIME_PARAMETERS
MRO:OBSERVATION_START_TIME	2018-01-21T12:51:50.434	N/A	N/A
START_TIME	2018-01-21T12:51:50.582	N/A	N/A
SPACECRAFT_CLOCK_START_COUNT	"1201006358:10651"	N/A	N/A
STOP_TIME	2018-01-21T12:51:53.012	inception (P571) and date field in template	date used as value for wikidata property, full time string parsed to data field for date field in wikitext template
SPACECRAFT_CLOCK_STOP_COUNT	"1201006360:38785"	N/A	N/A
PRODUCT_CREATION_TIME	2018-01-25T05:01:36	publication date (P577) but I am not completely sure here	date used as value for wikidata property.
GROUP = INSTRUMENT_SETTING_PARAMETERS
MRO:CCD_FLAG	(ON, ON, ON, ON, ON, ON, ON, ON, ON, OFF, ON, ON, ON, ON)	N/A	N/A
MRO:BINNING	(1, 1, 1, 1, 1, 1, 1, 1, 1, -9998, -9998, -9998, -9998, -9998)	N/A	N/A
MRO:TDI	(128, 128, 128, 128, 128, 128, 128, 128, 128, -9998, -9998, -9998, -9998, -9998)	N/A	N/A
MRO:SPECIAL_PROCESSING_FLAG	(NOMINAL, NOMINAL, NOMINAL, NOMINAL, NOMINAL, NOMINAL, NOMINAL, NOMINAL, NOMINAL, "NULL", "NULL", "NULL", "NULL", "NULL")	N/A	N/A
GROUP = VIEWING_PARAMETERS
INCIDENCE_ANGLE	42.714413 <DEG>	N/A	N/A
EMISSION_ANGLE	0.434473 <DEG>	tilt (P8208)	value: 0.434473 --> unit degree (Q28390)
PHASE_ANGLE	42.502572 <DEG>	N/A	N/A
LOCAL_TIME	15.10520 <LOCALDAY/24>	N/A	N/A
SOLAR_LONGITUDE	118.215906 <DEG>	N/A	N/A
SUB_SOLAR_AZIMUTH	173.163664 <DEG>	N/A	N/A
NORTH_AZIMUTH	270.000000 <DEG>	N/A	N/A
OBJECT = COMPRESSED_FILE
FILE_NAME	"ESP_053850_2170_RED.JP2"	N/A	N/A
RECORD_TYPE	UNDEFINED	N/A	N/A
ENCODING_TYPE	"JP2"	N/A	N/A
ENCODING_TYPE_VERSION_NAME	"ISO/IEC15444-1:2004"	N/A	N/A
INTERCHANGE_FORMAT	BINARY	N/A	N/A
UNCOMPRESSED_FILE_NAME	"ESP_053850_2170_RED.IMG"	N/A	N/A
REQUIRED_STORAGE_BYTES	1637741444 <BYTES>	N/A	N/A
DESCRIPTION	"JP2INFO.TXT"	N/A	N/A
INTERCHANGE_FORMAT	BINARY	N/A	N/A
OBJECT = UNCOMPRESSED_FILE
FILE_NAME	"ESP_053850_2170_RED.IMG"	N/A	N/A
RECORD_TYPE	FIXED_LENGTH	N/A	N/A
RECORD_BYTES	50966 <BYTES>	N/A	N/A
FILE_RECORDS	32134	N/A	N/A
IMAGE	"ESP_053850_2170_RED.IMG"	N/A	N/A
DESCRIPTION	"HiRISE projected and mosaicked product"	Could potentially be added to {{En}} in description	N/A
LINES	32134	N/A	N/A
LINE_SAMPLES	25483	N/A	N/A
BANDS	1	N/A	N/A
SAMPLE_TYPE	MSB_UNSIGNED_INTEGER	N/A	N/A
SAMPLE_BITS	16	N/A	N/A
SAMPLE_BIT_MASK	2#0000001111111111#	N/A	N/A
SCALING_FACTOR	1.41615214363203e-04	N/A	N/A
BANDS	1	N/A	N/A
OFFSET	0.060336154982679	N/A	N/A
BAND_STORAGE_TYPE	BAND_SEQUENTIAL	N/A	N/A
CORE_NULL	0	N/A	N/A
CORE_LOW_REPR_SATURATION	1	N/A	N/A
CORE_LOW_INSTR_SATURATION	2	N/A	N/A
CORE_HIGH_REPR_SATURATION	1023	N/A	N/A
CORE_HIGH_INSTR_SATURATION	1022	N/A	N/A
CENTER_FILTER_WAVELENGTH	700 <NM>	Should be added to wikitext template along with FILTER_NAME as the images will be uploaded as PNG 16 bit grayscale	N/A
MRO:MINIMUM_STRETCH	3	N/A	N/A
MRO:MAXIMUM_STRETCH	1021	N/A	N/A
FILTER_NAME	"RED"	N/A	N/A

Additionally I propose the following the properties

media type (P1163) --> "image/png"
source of file (P7482) --> file available on the internet (Q74228490) --> ((described at URL (P973) --> value: url to LBL file) and (work available at URL (P953) --> value: direct URL to JP2 file) and (operator (P137) --> University of Arizona (Q503419)) and perhaps (file format (P2701) --> JP2 (Q27979401)))
copyright status (P6216) --> public domain (Q19652) --> determination method or standard (P459) --> work of the federal government of the United States (Q60671452)
instance of (P31) --> photograph (Q125191)

@Meisam: --Askeuhd (talk) 16:08, 7 June 2022 (UTC)[reply]

freepd.com

Site contains production music tracks, in various genres, mp 3 format.

Source to upload from:

http://freepd.com/

- Do the media URLs follow a pattern?

None found. Tracks seem to be in sub-directories related to nominal genre, MP3 files are named for the track title apparently.

- Does the site have an API?

Unknown.

- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?)

Unknown.

- Did you contact the site owner?

Site owner not contacted.

Describe the works to be uploaded in detail (audio files, images by …):

"Production music", in various genres., in MP3 format.

Which license tag(s) should be applied?

Site claims tracks are in the public domain:- http://freepd.com/faq.html ; However some of these tracks were previously under CC-BY on the site owners other site at incompetech.

Is there a template that could be used on the file description pages? Do you think a special template should be created?

{{Information}} with additional field as was done on the previous batch upload for incompetech.

ShakespeareFan00 (talk) 10:20, 18 December 2017 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Commons:Batch uploading/timbeek.com/

Source to upload from:

http://timbeek.com/ in particular music tracks listed in http://timbeek.com/royalty-free-music/isrc/

- Do the media URLs follow a pattern?

No general pattern, but there's a master list (not sure if it's complete) of track pages here - http://timbeek.com/royalty-free-music/isrc/, Donwload links in the UI seem to link to numbered subdirectories, but general pattern undetermined or not obvious.

- Does the site have an API?

Unknown.

- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?)

Unknown

- Did you contact the site owner?

Site owner not contacted.

Describe the works to be uploaded in detail (audio files, images by …):

A Small set of 'production music' tracks, in assorted genres.

Which license tag(s) should be applied?

See: http://timbeek.com/royalty-free-music/license/ , assuming attribution requirments are met the music appears to be under CC-BY 4.0. (see also: http://timbeek.com/royalty-free-music/faq/ and http://timbeek.com/royalty-free-music/copyright/)

Is there a template that could be used on the file description pages? Do you think a special template should be created?

{{Information}} with additional fields as was previously implemented for the incomptech.com batch upload(this site seems to use a simmilar approach).

ShakespeareFan00 (talk) 19:05, 15 December 2017 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

USDA NRCS Plants Database

Source to upload from: http://plants.usda.gov/
- Do the media URLs follow a pattern? Yes.
- Does the site have an API? No.
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) valid XHTML
- Did you contact the site owner? No.

Describe the works to be uploaded in detail (audio files, images by …): Public domain: 10771 photos and 7064 line drawings, with species information for categorization. There are other copyrighted images as well, some of which may be freely licensed.

Which license tag(s) should be applied?

Is there a template that could be used on the file description pages? Do you think a special template should be created?

Opinions

@Guanaco: There is a lot of copyrighted material within these images, e.g. [10] [11]. (Just because this is a U.S. government web site this does not mean all the material is U.S. government material and by this means freely usable!) Actually I have not found too many images that really can be used (e.g. [12]). You should at least provide a procedure how to distinguish between copyrighted and free material. --Reinhard Kraasch (talk) 11:02, 9 July 2017 (UTC)[reply]

@Reinhard Kraasch: The gallery search function [13] has a filter by copyright status. [14]

I've found that the URLs linked by the thumbnails provide species information within <title>: https://plants.usda.gov/core/profile?symbol=HACA2&photoID=haca2_003_ahp.jpg#

and correspond to the URLs of the actual files: https://plants.usda.gov/gallery/pubs/haca2_003_php.jpg
as well as the URL with copyright status and recommended attribution info: https://plants.usda.gov/java/usageGuidelines?imageID=haca2_003_ahp.jpg

The search is navigable with &page=2, 3, 4, etc.

I'm actually interested in scripting this myself now, though it would be my first batch upload task. Guanaco (talk) 14:23, 9 July 2017 (UTC)[reply]

@Guanaco: Well, just go on... On the other hand it always is a good idea to have a second opinion with such a batch upload - especially for the non-technical aspects. --Reinhard Kraasch (talk) 20:52, 10 July 2017 (UTC)[reply]

Assigned to	Progress	Bot name	Category

US National Archives

I am hoping to begin a bulk upload of media from the US National Archives in the next few weeks. This will be a very different approach from the first upload, which was based on uploading files from an offline drive and scraping HTML for the metadata. This time around, NARA has an API for our online catalog, and so I am building a bot, using mwclient, to upload using the live metadata and files from the API. Some details:

Dataset

The dataset includes all PD materials at https://catalog.archives.gov (API: https://catalog.archives.gov/api/v1). I plan to begin with a series of ~100,000 WWI-era photos. Technically, there are over 15 million files (and counting) in this dataset.

File names

The script is currently configured to name files with the formula: For single-page items:

"File:[TITLE] - NARA - [NAID].ext"
Where "[TITLE]" is the catalog record's title field, and "[NAID]" is the National Archives Identifier. If this is over the character limit, "[TITLE]" is automatically truncated, with "(...)" appended.

For multi-page items (since the above formula would give all files belonging to one catalog record the same title):

"File:[TITLE] - NARA - [NAID] (page X).ext"

Metadata

We are developing a custom metadata mapping, since NARA does not adhere to a metadata standard. You can see the metadata template we use here: {{NARA-image-full}}. Some notes:

While all the records in this catalog come from NARA or partner institutions, there are many different facility locations, and some NARA facilities have their own institutions templates already (e.g. US presidential libraries). Therefore, I am creating institution templates to go along with all NARA locations, and the script will insert the correct institution template based on a mapping.

NARA's authority file is not yet mapped to Wikidata, however that is definitely something that would be useful in the future. For now, we will upload files with NARA's creator and author names and their NAIDs and links back to the catalog authority record. However, including the NAIDs in a Commons template field means that in the future, Wikidata could be used to make creator templates appear instead. Any help with this would be appreciated.

Licenses

Because NARA records are nearly all (>99%) derived from the records of US federal agencies, these uploads will use {{PD-USGov}} or its subtemplates. Most NARA records are in one of about 600 record groups based on their creating agency, so I am using a mapping of NARA record groups to Commons PD-USGov templates so that the bot can apply the more specific agency templates in most cases. Help filling out this mapping would be appreciated.

Nearly all holdings of the US National Archives are in the public domain as a work of the federal government (or, otherwise, due to age). This is marked in the "use restriction" field in the catalog, with a value of "Unrestricted" indicating public domain determination by the archivists. Therefore, the script will be configured to skip over any records in which the use restriction is anything other than "unrestricted" (even "possibly" ones, which could ultimately be PD, but need a human determination).

Categories

All uploads will be automatically categorized by the metadata template into Category:Media contributed by the National Archives and Records Administration and a category for the series they belong to (such as Category:US National Archives series: DOCUMERICA: The Environmental Protection Agency's Program to Photographically Document Subjects of Environmental Concern, compiled 1972 - 1977). Eventually, the script will be designed to create the series category if a file is uploaded for a series which does not yet have one.

When it comes to topical categories, past NARA uploads utilized the {{Uncategorized}} tag to encourage the community to add topical tags. However, since this creates work for the community, I am planning this time around to run uploads a small batch (hundreds to a few thousand) at a time, so I can upload them with one or more topical categories that apply to all records in the batch, rather than uncategorized.

Code

You can find the upload bot's code at https://github.com/usnationalarchives/wikimedia-upload. This project is being developed in public on NARA's official GitHub account. I would welcome collaboration (pull requests or otherwise) there. In addition, the Commons community is welcome to file issue reports on that repo.

Examples

The most recent test uploads can be viewed in Category:US National Archives series: American Unofficial Collection of World War I Photographs. I am still polishing the upload script, but these examples essentially represent what should be expected from the bot once it gets started.

Opinions

The bot account is technically already flagged from the last bulk upload a couple of years ago, however I would like to submit the current plan to community review before restarting uploads. If there are any opinions on the bot's design or the format of uploads or other issues, I am happy to hear them. We'd also like to know whether to limit what is uploaded in any way—as in, would Commons actually be interested in 15 million files, or might some of these, like the millions of census cards, not be of interest. Also, if anyone is interested in helping out with the coding or other tasks, please feel free to let me know. This is a big undertaking. Thanks! Dominic (talk) 17:25, 31 May 2017 (UTC)[reply]

Assigned to	Progress	Bot name	Category
User:Dominic	Coding	User:US National Archives bot	Category:Media contributed by the National Archives and Records Administration

ESA-Rosetta-NAVCAM

Source to upload from: http://imagearchives.esac.esa.int/index.php?/recent_pics
- Did you observe an URL pattern? See http://imagearchives.esac.esa.int/index.php?/page/rosetta_navcam
- Do you know whether the site has an API
- What else can ease uploading (is the site valid XHTML, WCM they use…)?
- Did you contact the site owner? No.

Describe the works to be uploaded in detail (audio files, images by …):

Images the comet 67P/CHURYUMOV-GERASIMENKO by the NAVCAM on the Rosetta spacecraft.

Which license tag(s) should be applied? ESA/Rosetta/NAVCAM – CC BY-SA IGO 3.0 (see {{ESA-ROSETTA-NAVCAM}} for the specific license template.)

Is there a template that could be used on the file description pages? Do you think a special template should be created?

Yann (talk) 14:32, 6 June 2015 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

USC Cinema

Source to upload from

https://archive.org/details/usc-sound-effect-archive

License

The files in this collection claim to be licensed CC BY 4.0. This is not true--all this archive is was someone collating the files and uploading them there. These files were all uploaded by Craig Smith on freesound.com under a CC0. The Gold and Red files are a valid {{CC0}} as Craig Smith works for USC. The Sunset Editorial files are all either {{PD-US-defective notice}} or {{PD-US-defective notice-1978-89}}. The notices are defective because according to the linked blog, all SSE ever got was a credit line. The company was no longer active by 1989, and I checked and there are no copyright registrations under SSE's name. The publication years of the sound effects however, are unknown, so I plan on tagging everything with PD-US-defective notice-1978-89.

Description

This is a set of audio files by the University of Southern California and Sunset Editorial consisting of the original recordings of sound effects used in movies from the 60s to 80s; a few of these sound effects are very famous (like the Wilhelm Scream). This file conveniently maps all the sound effects with a metadata .csv file with descriptions and upload dates and everything, so setting up a batch upload isn't too difficult. I'm prepared to do this upload myself.

Snowmanonahoe (talk) 02:21, 27 May 2023 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

NIH Bioart

These biology icons are very useful, please upload them all.

Source to upload from

https://bioart.niaid.nih.gov/

License

Description

They can be used in medical diagrams and various other information graphics and are of high importance because not only are such often used in important health education but also Biorender icons can't be used due to their licensing and these icons could be used as an alternative or even to substitute these.
Please add them to Category:NIH BioArt further categories in addition that better enable people to find these would also be good. Please make sure to use descriptive titles and descriptions, probably by using the title on the site and its description and keywords.
The website doesn't load properly on my end. Don't know why that is but it makes it even more important to upload these icons here and integrate it into this large media repository.

Prototyperspective (talk) 15:15, 16 May 2025 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Wikimedia Codex icons

Source to upload from

Tracked in Phabricator
Task T401186

Tracked in Phabricator
Task T361497

The phab task going to take forever, but putting it here in case the batch upload takes longer.

gerrit.wikimedia.org/r/plugins/gitiles/design/codex/+/refs/heads/main/packages/codex-icons/src/images/

into Category:Wikimedia Codex icons

License

gerrit.wikimedia.org/r/plugins/gitiles/design/codex/+/refs/heads/main/packages/codex-icons/LICENSE LICENSE

MIT License

Description

All in the same path gerrit.wikimedia.org/r/plugins/gitiles/design/codex/+/refs/heads/main/packages/codex-icons/src/images/, also see [[15]]
Does the site have an API? gerrit.wikimedia.org/r/plugins/gitiles/design/codex/+/refs/heads/main/README.md and gerrit.wikimedia.org/r/plugins/gitiles/design/codex/+/refs/heads/main/packages/codex-icons/BUILDING.md
What else could ease uploading? mw:Manual:Coding conventions/SVG
Did you contact the site owner? No.
Is there a template that could be used on the file description pages, or should one be created? {{Cc-by-4.0}}

Waddie96 (talk) 20:38, 4 August 2025 (UTC)[reply]

Asked for help at Commons:Bots/Work requests#c-Waddie96-20250805100500-Upload Codex icon set 250 svg's. (Asked for help with template at phab:T101666 to use CSS instead of this method, oh well.) Waddie96 (talk) 10:07, 5 August 2025 (UTC)[reply]

Please ping any replies :) Waddie96 (talk) 10:46, 5 August 2025 (UTC)[reply]

Half done

@M5: Good day! Thanks for batch uploading Category:Wikimedia Codex icons! I really appreciate it. Per my request at Commons:Batch uploading/Wikimedia Codex icons and Commons:Bots/Work requests#c-Waddie96-20250805100500-Upload Codex icon set 250 svg's ,

Is there any chance you could upload all the same icons again, but each time in a different subfolder with a different color applied?

- @color-base @color-subtle @color-placeholder @color-notice @color-error @color-warning @color-success
- (per the Codex design guide,: wmdoc:codex/latest/componenxts/demos/icon.html)
- (do not upload different sizes, per image policy we upload the largest which is @size-125 or 20px)

Waddie96 (talk) 21:11, 5 August 2025 (UTC)[reply]

@Waddie96 Hi can you check if I applied the colors correctly here before I upload anything https://files.catbox.moe/2u15nu.zip

999

REAL 💬 ⬆ 10:23, 22 August 2025 (UTC)[reply]

@999real Please let me know how you did this so fast???? I can teach myself if you don't have the time, but what application?

Thanks so much! But just the colors that are wrong. See doc.wikimedia.org/codex/latest/components/demos/icon.html#color icon component #colors, if you convert the color tokens at prev page, using their color tokens list into HEX then the icons are colored as follows:

base == #202122
subtle == #54595d
notice == #404244
error == #f54739
warning == #ab7f2a
success == #099979

Thanks so much! Waddie96 (talk) 11:49, 22 August 2025 (UTC)[reply]

And is there any chance you could somehow move the old base (black) icons in root of Category:Wikimedia Codex icons into an old icons category, then I'll move all instances that they're getting used to the new base icons, and then request for the old ones to be deleted. :-) Waddie96 (talk) 11:50, 22 August 2025 (UTC)[reply]

I just use find and replace in text editor and I moved them in Category:Black_Wikimedia_Codex_icons

999

REAL 💬 ⬆ 12:21, 22 August 2025 (UTC)[reply]

OMG! And the license template is template:Wikimedia Codex! @999real Waddie96 (talk) 11:56, 22 August 2025 (UTC)[reply]

@999real OMG! Thanks so much! You're a star. ~~And I'm so sorry but I made a mistake with the Color-notice. It's #72777d not the other HEX. Is it too much trouble to correct this?~~

Oh so you use a find and replace text editor script/program? Nifty wouldn't have considered the existence of such a thing. Waddie96 (talk) 06:43, 23 August 2025 (UTC)[reply]

Wait scratch that, Color-placeholder and the correct Color-notice are the same HEX Waddie96 (talk) 06:44, 23 August 2025 (UTC)[reply]

@999real Ohhhh, that's what I forgot. I forgot to add the color @progressive which is HEX #36c. Waddie96 (talk) 12:57, 23 August 2025 (UTC)[reply]

Ok I uploaded it too. Sorry I don't understand about color-placeholder, is there something to fix?

999

REAL 💬 ⬆ 13:38, 23 August 2025 (UTC)[reply]

@999real Just the @color-icon-notice I got wrong, it's #72777d and not #404244. But @color-icon-notice is the same hex as @color-placeholder, so we can copy the color-placeholder cat and paste in main dir, and rename the cat and all children to color-notice. And without deleting the old incorrect color-notice. (Sorry for using Color-notice and Color-icon notice interchangeably, but I hope I made sense. You can check out the Codex design style guide for Wikimedia color palette.) Waddie96 (talk) 14:15, 23 August 2025 (UTC)[reply]

Sorry I don't understand properly

If the current color-notice is wrong what are we doing with it? It shouldn't be deleted? I can overwrite all of them if we need to
Are we trying to have both color-icon-notice and color-placeholder with the same color? I don't think we should have duplicates, we could redirect the file names if we need to have 2 with the same color

999

REAL 💬 ⬆ 14:41, 23 August 2025 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Old requests (before 2020-01-01)

Batch uploads in progress

Batch uploads on hold

Done (to be moved to past batch uploads)

Failed

Scripters

Multichill (talk · contribs)
Jarekt (talk · contribs)
Slick (talk · contribs) - no audio/video
Husky (talk · contribs)
DaxServer (talk · contribs)

Currently inactive

TheDJ (talk · contribs)
Duesentrieb (talk · contribs)
Aude (talk · contribs) - including batch audio & video uploads
Basvb (talk · contribs)
Fæ (talk · contribs) - see project list

Tools

See Commons:Upload tools. The Python Wikipedia Bot framework supports image uploads and is particularly versatile.
d:Help:QuickStatements - tool for batch upload of metadata to Wikidata, which can be than accessed by {{Artwork}} and other templates.
Flickrripper allows batch uploading from a set, group or a user id on flickr.

Scripts, Examples and Information

the scripts I using on jobs here and here
a bash script to extract the VRINs on (U.S. military) pictures on commons, can very usefull to find duplicate before upload
Details about 'Zoomify' images and how to get it (in German)
Howto import images from news.kremlin.ru: import news.kremlin.ru news gallery.sh & import news.kremlin.ru photo gallery.sh
Another option so to download the images to your local machine, then upload with Pattypan.