Unclear, ambiguous, and/or contradictory licensing by public bodies

This topic is to collect examples of unclear, ambiguous, and/or contradictory licensing by public bodies.

:warning: This topic does not cover data portals that are either silent on licensing or subject to clear but problematic licensing terms — problematic relative to open science that is.

This topic is for material provided under muddled legal terms. In particular, the concept of “non‑commercial” use carries no clear definition under law. In other cases, established licenses not designed or intended to cover data are applied to datasets and data portals. And in other cases, the underlying open license is invalidated by specifying additional legal restrictions — in such circumstances, the original license grant is normally withdrawn in its entirety.

The following table provides an overview:

Organization Portal, product, source Local
ECMWF Copernicus products incl ERA5 post 13
Joint Research Centre Energy and Industry Geography Lab  post 6
IIASA IAMC 1.5°C Scenario Explorer
NGFS Scenario Explorer
post 11
International Energy Agency recent IEA reports post 5
International Energy Agency methane abatement tooling post 15
UNFCCC website post 2
World Bank Group energydata.info portal post 414 
UNEP World Conservation
Monitoring Centre
World Database of Protected Areas post 3

For background on the legal issues in a European context:

UNFCCC website

This single posting has been combined from two earlier posting elsewhere on this site by @mrchrisadams. With the first posting added on 22 February 2022. The UNFCCC is the United Nations Framework Convention on Climate Change.

Problem

On the subject of data leakage, this caught my eye.

I can see this page here from the UN — where as part of the IFI project [international financial institutions] there is guidance on listing carbon intensity information energy for appraising investments:

This lists this information here — various standards and guidance:

The terms of service are interesting:

I. Copyright

All use of materials provided on this web site must follow the Terms and Conditions of Use herein, including reproduction or transmission, in whole or in part, in any form or by any means, electronic or mechanical, including photocopying, recording or the use of any information storage and retrieval system. Any exception to this requires permission in writing from the UNFCCC secretariat.

(snip)

All official texts, data and documents, including low resolution webcast transmissions, are in the public domain and may be freely downloaded, copied and printed provided no change to the content is introduced, and the source is duly acknowledged.

Copyright exists for all high-resolution broadcast quality footage and archival audio and video materials of UNFCCC conferences.

In order to find out more about the licensing conditions, find more information here.

Later on though, the page says this in the same terms:

Website usage

A. General

1.Use of this website constitutes agreement with the Terms of Use contained herein.

2.The UNFCCC secretariat (“UNFCCC”) maintains this website (the “Site”) as a courtesy to those who may choose to access the Site (“Users”). The information presented herein is for informative purposes only. The UNFCCC grants permission to Users to visit the Site and to download and copy the information, documents and materials (collectively, “Content”) from the Site for the User’s personal, non-commercial use, without any right to resell or redistribute them or to compile or create derivative works therefrom, subject to the terms and conditions outlined herein, and also subject to more specific restrictions that may apply to specific Content within this Site.

3.The UNFCCC administers this Site. All Content on this Site from the UNFCCC and all Users, whether registered or otherwise, are subject to the present Terms of Use.

I’ve sent an email to the webmaster to ask which ones apply.

Response

Here’s the abridged response from the webmaster at unfccc.int:

The paragraph 2 in our Terms of Use states that any content from our homepage can only be used only for: “personal, non-commercial use, without any right to resell or redistribute them or to compile or create derivative works therefrom”. This applies to all data, documents, etc. which are published on any of our webpages.

I hope this answers your enquiry.

World Database of Protected Areas

This is a recent favourite of mine which I think also falls under this thread’s category:

From their legal section (highlights are my own):

[…]

No Commercial Use

Neither (a) the WDPA Materials, the WD-OECM Materials and the GD-PAME Materials nor (b) any work derived from or based upon the WDPA Materials, the WD-OECM Materials and the GD-PAME Materials (“Derivative Works") may be put to Commercial Use without the prior written permission of UNEP-WCMC.
[…]

I might be overinterpreting this, but wouldn’t that also include any research results dervied with this dataset?

No Sub-licensing or Redistribution of WDPA, WD-OECM and GD-PAME Data

[…]
You may not redistribute the WDPA, WD-OECM and GD-PAME Data contained in the WDPA, WD-OECM and GD-PAME in whole or in part by any means including (but not limited to) electronic formats such as web downloads, through web services, through interactive web maps (including mobile applications) that grant users download access, […]

However what is forbidden is allowed under certain circumstances:

Publishing the WDPA, WD-OECM and GD-PAME

You may publish the WDPA, WD-OECM and GD-PAME Materials in whole or in part, including on-line, providing (a) the WDPA, WD-OECM and GD-PAME Data are not downloadable and […]

From a technical perspective I’m still confused about when making something available to view is not hosting and providing a download (the data is still transferred). It reminds me a bit of the streaming vs. downloading debate which was ended a few years ago.

It also begs the question how obfuscated and difficult I have to make the download process for users to prevent scraping.

World Bank Group energydata.info portal

As the Creative Commons CC‑BY‑4.0 increasingly becomes the de facto standard for public interest datasets, there seems to be accompanying efforts to add additional restrictions. Here is one example:

The World Bank Group (WBG), via its energydata.info portal, offers energy sector datasets worldwide, including for Africa. On the surface, it uses CC‑BY‑4.0 licensing. But if you drill down, you find this legal notice (emphasis added):

Copyright: Solar resource data © 2019 Solargis. The data is published in Global Solar Atlas under a Creative Commons 4.0 Attribution International license, CC BY 4.0 with the following mandatory and binding addition: Any and all disputes arising under this License that cannot be settled amicably shall be submitted to mediation in accordance with the WIPO Mediation Rules 3 in effect at the time the work was published. If the request for mediation is not resolved within forty-five (45) days of the request, either You or the Licensor may, pursuant to a notice of arbitration communicated by reasonable means to the other party refer the dispute to final and binding arbitration to be conducted in accordance with UNCITRAL Arbitration Rules as then in force. The arbitral tribunal shall consist of a sole arbitrator and the language of the proceedings shall be English unless otherwise agreed. The place of arbitration shall be where the Licensor has its headquarters. The arbitral proceedings shall be conducted remotely (e.g., via telephone conference or written submissions) whenever practicable, or held at the World Bank headquarters in Washington DC.

A screenshot of the metadata provides more context:

Those extra conditions are possibly quite sensible, but would nonetheless appear to be in conflict with provision 2.a.5.B in the CC‑BY‑4.0 license text stating “no downstream restrictions”.

What happens legally in this case? Does the CC‑BY‑4.0 license grant collapse? Are the additional restrictions from the WBG unenforceable? Or is this a legal gray zone? My pick would be that the Creative Commons license is withdrawn.

Nor did I sift through the energydata-info portal to see how prevalent this particular licensing practice is. My suspicion is that this is not an isolated case.

International Energy Agency use of CC‑BY‑NC‑ND‑3.0‑IGO

The Creative Commons license used by the International Energy Agency also falls into the camp of inappropriate licensing. Some very recent IEA publications have been dual licensed Creative Commons CC‑BY‑NC‑ND‑3.0‑IGO. This license is not open due to the NC and ND qualifiers. But importantly here, this instrument does not remove any 96/9/EC database protection that might be present. And as such, qualifies as a bizarre and inappropriate choice under which to offer the public numerical information.

JRC’s Energy and Industry Geography Lab

Link: Energy and Industry Geography Lab - European Commission

  • What is “public use”?
  • Which rights to use and publish derivatives / redistribute the datasets do are granted?
  • Does this constitute a license or are just “all right reserved”?

Quick update on this one.

I had a call with the nice people at the UNFCCC to understand the intention of releasing the data, and the likely licensing options.

I came away with the impression that

a) quite a lot of work had gone into doing various kinds of regression analysis to provide per country figures and
b) the intention was indeed for these to be used as widely as possible

Because there’s been quite alot of work (and I’m definitely not a lawyer here), it sounds like thw work probably would trigger the sui generis database rights, meaning the relatively small subset of numbers they published can be relicensed under a permissive license like CC-BY-4.0 or similar:

Any compiler or “maker” of a database (the term used by the Database Directive) who made a qualitatively and/or quantitatively substantial investment in either the obtaining, verification or presentation of the contents automatically attracts a so-called “sui generis database right”.11 This right was newly introduced in 1996 with the Database Directive. The sui generis database right is similar to copyright but is not granted for creativity but for the financial and professional investment made in obtaining, verifying, and presenting the contents. Hence a database that is structured without creativity but was expensive to make is protected under the sui generis database right, while not being subject to copyright.

The sui generis database right grants the database maker the exclusive right of “extraction and/or re-utilization of the whole or of a substantial part”.

Source: Open data for electricity modeling: Legal aspects

I’ve referred to the [Open data for electricity modeling] paper, but if there are any other resources or papers that might help me make the case to have this data released as openly as possible, I’d be very grateful for them :+1:

@mrchrisadams The paper you mentioned is very good. In full:

Another option is a gray publication of mine (that I was asked to upload to zenodo and duly did):

  • Morrison, Robbie (6 February 2022). Which open data license? — Release 06. doi:10.5281/zenodo.5987672. 14 pages. Creative Commons CC‑BY‑4.0 license.

These licensing recommendations will not change if the Data Act currently before the European Parliament is passed in its current form. But the introduction of a Data Producers Right for machine‑generated data will make the need to explicitly open license all the more necessary.

Thanks for your efforts with the UNFCCC. The secretariat is located in Germany and I guess that will determine the choice of law? HTH R.

Thanks for your efforts @mrchrisadams !

Also an update on the UNFCC / WDPA dataset:

I was exchanging emails over the last months with them regarding technical access and licensing. They communicated to me that they hope to be able to make improvements in the future.

I also mentioned the openmod community and that I could probably find a few users in our community should they want to have some user opinions and perspectives during the process.

Hi folks.

Minor update - I asked about the last meeting:

The IFI TWG, at its last meeting in late May, discussed the conditions of use of IFI grid emission factor database by third parties, and the group decided to make a decision at its next meeting to allow time for members, which is scheduled in the 1st week of July.
We will inform you of the outcome after the meeting.

Best regards,

(person at UNFCC IFI TWG )

My guess is that this might establish a precedent for future decisions around licensing for further use.

IIASA scenarios databases

Two scenarios databases maintained by the International Institute for Applied Systems Analysis (IIASA) offer non‑open licensing and provide problematic statements claiming 96/9/EC database protection. The databases are:

  • IAMC (ongoing). IAMC 1.5°C Scenario Explorer hosted by IIASA. Integrated Assessment Modeling Consortium (IAMC). Part of the IPCC SR15 report. IAMC is Integrated Assessment Modeling Consortium.

  • IIASA (ongoing). NGFS Scenario Explorer. International Institute for Applied Systems Analysis (IIASA). Laxenburg, Austria. Open access. NGFS is Network for Greening the Financial System.

Both sites apply a “Public License [that] is adapted from the Creative Commons Attribution 4.0 International Public License”. And both sites claim too that “The scenario ensemble is protected by EU Sui generis database rights”.

That database claim is almost certainly incorrect in light of a 2021 European court ruling on such matters (CJEU 2021). Indeed, the changes to the CC‑BY‑4.0 license text appear designed to support any extant 96/9/EC database rights. As such protection is highly unlikely to apply, the easiest solution would be for IIASA to remove the remark claiming database protection and revert to unmodified CC‑BY‑4.0 license notices. Then both the contents and the database itself would become genuinely open and reusable and the content would be legally compatible with other material under CC‑BY‑4.0 licensing (or alternatively with legal provisions that are inbound compatible).

Just to note I have contacted IIASA in relation to the databases listed above. For some interpretation of the CJEU ruling in 2021, please see:

I feel like unearthing this topic, be it only for therapeutic purposes.

Checking the ECMWF License to Use Copernicus Products, I noticed it is not as open as I had considered it in the past. While it offers products like ERA5 for free-as-in-beer, it imposes the following restrictions that seem incompatible with e.g. CC-BY-4.0:

  • Prescribed attribution note in 5.
  1. Attribution
    5.1. All users of Copernicus Products must provide clear and visible attribution to the Copernicus programme. The Licensee will communicate to the public the source of the Copernicus Products by crediting the Copernicus Climate Change and Atmosphere Monitoring Services:

5.1.1. Where the Licensee communicates or distributes Copernicus Products to the public, the Licensee shall inform the recipients of the source by using the following or any similar notice:

  • ‘Generated using Copernicus Climate Change Service information [Year]’ and/or
  • ‘Generated using Copernicus Atmosphere Monitoring Service information [Year]’.

5.1.2. Where the Licensee makes or contributes to a publication or distribution containing adapted or modified Copernicus Products, the Licensee shall provide the following or any similar notice:

  • ‘Contains modified Copernicus Climate Change Service information [Year]’; and/or
  • ‘Contains modified Copernicus Atmosphere Monitoring Service information [Year]’

where the “or any similar notice” might be leniently considered compatible with CC-BY-4.0 (maybe?). But what is bothering me more about this, is that Copernicus is not the copyright holder, but rather they state that the copyright is held by the European Union.
In my humble understanding that is incompatible?

  • Statement of “Irresponsibility”

5.1.3. Any such publication or distribution
[…] shall state that neither the European Commission nor ECMWF is responsible for any use that may be made of the Copernicus information or data it contains.

Which as a requirement we’ve discussed before compatible with CC-BY.

My Friday evening brain thinks I should stop worrying about licensing again.

In what I believe it being a similar manner, the World Bank also imposes additional licensing terms to it CC-BY 4.0 licenses in the data catalogue.

Hidden behind links, one can find this:

Creative Commons Attribution 4.0 (CC-BY 4.0)

[…] CC-BY 4.0, with the additional terms below, is the default license for all Datasets produced by the World Bank itself and distributed as open data.

All users of these Datasets under the CC-BY 4.0 License also agree to the following mandatory terms: Any and all disputes arising under this License that cannot be settled amicably shall be resolved in accordance with the following procedure:

  1. Pursuant to a notice of mediation communicated by reasonable means by either You or the Licensor to the other, the dispute shall be submitted to non-binding mediation conducted in accordance with rules designated by the Licensor in the copyright notice published with the Work, or if none then in accordance with those communicated in the notice of mediation. The language used in the mediation proceedings shall be English unless otherwise agreed.
  2. If any such dispute has not been settled within 45 days following the date on which the notice of mediation is provided, either You or the Licensor may, pursuant to a notice of arbitration communicated by reasonable means to the other, elect to have the dispute referred to and finally determined by arbitration. The arbitration shall be conducted in accordance with the rules designated by the Licensor in the copyright notice published with the Work, or if none then in accordance with the UNCITRAL Arbitration Rules as then in force. The arbitral tribunal shall consist of a sole arbitrator and the language of the proceedings shall be English unless otherwise agreed. The place of arbitration shall be where the Licensor has its headquarters. The arbitral proceedings shall be conducted remotely (e.g., via telephone conference or written submissions) whenever practicable
1 Like

The IEA is trying its hands on open sourcing models with the methane abatement cost model being available on GitLab, apparently under a GPLv3 license (code) and CC BY SA 4.0 (data).

(Ignoring the fact that you need to register in order to get access to the data, which some of us already don’t consider “open” data)

To me as an amateur the licensing terms look ok-ish, they remind about the need to credit IEA and present a preferred (not a compulsory) attribution note

If you convey the Code and/or share the Datasets, you must give appropriate credit to the IEA.

The IEA’s preferred attribution is:
IEA Methane Abatement Model, IEA, International Energy Agency / Methane / Abatement model / Cost calculation · GitLab,
License: source code: GNU GPL v3.0; or datasets: CC BY SA 4.0.

They then continue with the following additional notes:

Please note the following:

  • if the OECD/IEA cannot settle amicably with you any disputes relating to the model, either you or the OECD/IEA may, pursuant to a notice of arbitration communicated by reasonable means to the other, elect to have the dispute referred to and finally determined by arbitration. The arbitration shall be conducted in accordance with the Arbitration Rules of the United Nations Commission on International Trade Law (UNCITRAL) then in force. The arbitral tribunal shall consist of a sole arbitrator and the language of the proceedings shall be English unless otherwise agreed. The place of arbitration shall be Paris. The arbitral proceedings shall be conducted remotely (e.g., via telephone conference or written submissions) whenever practicable. The parties shall be bound by any arbitration award rendered as a result of such arbitration as the final adjudication of such a dispute; and

  • Nothing in the licensing terms referred to above constitutes or may be interpreted as a limitation upon,
    or waiver of, any privileges and immunities that apply to the OECD/IEA or you, all of which are specifically reserved.

Are notes like this fine from a legal perspective?
While I can understand that they might want to re-emphasize aspects of the license, for me as a user it introduces uncertainty about the license and exact terms.

It give them a “C” for their repository licensing.

Deductions because of
(a) A custom formatted LICENSE file in the code repository, that only states the license and also states the license for the data
(b) the additional notes in the license file,
(c) them not including or linking to a copy of GPLv3 and CC BY SA 4.0,
(d) using CC BY SA 4.0 instead of CC BY 4.0 .

1 Like