<?xml version='1.0' encoding='utf-8'?>
<eprints xmlns='http://eprints.org/ep2/data/2.0'>
  <eprint id='http://edata.bham.ac.uk/id/eprint/1292'>
    <eprintid>1292</eprintid>
    <rev_number>18</rev_number>
    <documents>
      <document id='http://edata.bham.ac.uk/id/document/7104'>
        <docid>7104</docid>
        <rev_number>3</rev_number>
        <files>
          <file id='http://edata.bham.ac.uk/id/file/39446'>
            <fileid>39446</fileid>
            <datasetid>document</datasetid>
            <objectid>7104</objectid>
            <filename>UnpaywallPDFDownloader_Package.zip</filename>
            <mime_type>application/zip</mime_type>
            <hash>cd5fb7686f915b8b26eb54c6d79157c1</hash>
            <hash_type>MD5</hash_type>
            <filesize>6183</filesize>
            <mtime>2025-05-20 22:04:55</mtime>
            <url>http://edata.bham.ac.uk/1292/1/UnpaywallPDFDownloader_Package.zip</url>
          </file>
        </files>
        <eprintid>1292</eprintid>
        <pos>1</pos>
        <placement>1</placement>
        <mime_type>application/zip</mime_type>
        <format>archive</format>
        <security>public</security>
        <license>cc_by_4</license>
        <main>UnpaywallPDFDownloader_Package.zip</main>
        <content>full_archive</content>
      </document>
    </documents>
    <eprint_status>archive</eprint_status>
    <userid>654</userid>
    <dir>disk0/00/00/12/92</dir>
    <datestamp>2025-05-29 12:54:16</datestamp>
    <lastmod>2025-05-29 14:21:09</lastmod>
    <status_changed>2025-05-29 12:54:16</status_changed>
    <type>data_collection</type>
    <metadata_visibility>show</metadata_visibility>
    <item_issues>
      <item>
        <id>duplicate_title_1300</id>
        <type>duplicate_title</type>
        <description>Duplicate title to 


   &lt;a href=&quot;http://edata.bham.ac.uk/view/creators/Liu=3ALixu=3A=3A.html&quot;&gt;&lt;span class=&quot;person_name&quot;&gt;Liu, Lixu&lt;/span&gt;&lt;/a&gt;


(2025).

&lt;a href=&quot;http://edata.bham.ac.uk/cgi/users/home?screen=EPrint::View&amp;amp;eprintid=1300&quot;&gt;&lt;xhtml:em xmlns:xhtml=&quot;http://www.w3.org/1999/xhtml&quot;&gt;Research software supporting the publication “Leveraging Large Language Models to Classify and Inspect Defects in Reinforced Autoclaved Aerated Concrete (RAAC)”.&lt;/xhtml:em&gt;&lt;/a&gt;

University of Birmingham.

DataSet.

&lt;a href=&quot;https://doi.org/10.25500/edata.bham.00001300&quot; target=&quot;_blank&quot;&gt;https://doi.org/10.25500/edata.bham.00001300&lt;/a&gt;.</description>
        <timestamp>2025-06-03 01:30:06</timestamp>
        <status>discovered</status>
      </item>
    </item_issues>
    <item_issues_count>1</item_issues_count>
    <creators>
      <item>
        <name>
          <family>Liu</family>
          <given>Lixu</given>
        </name>
        <id>l.liu.9@bham.ac.uk; lixu@verdemetrix.com</id>
        <orcid>0009-0008-8011-496X</orcid>
      </item>
    </creators>
    <title>Research software supporting the publication “Leveraging Large Language Models to Classify and Inspect Defects in Reinforced Autoclaved Aerated Concrete (RAAC)”</title>
    <divisions>
      <item>10col_ephy</item>
    </divisions>
    <keywords>Python, PDF Downloader, Unpaywall API, Open Access, Research Tools, DOI, Academic Software, Scholarly Communication, Python, Automation</keywords>
    <note>The script requires the user to specify their own email address for Unpaywall API access and to adjust file paths for CSV input and download output. It is intended for research use only and should be run with care to avoid excessive requests to external servers. The tool does not circumvent paywalls and only downloads content legally available via Unpaywall. Audio notification may not work in all server environments.
Source code reviewed on 2025-05-20. The script contains no personal or sensitive data and complies with the Unpaywall API usage policy. Repository link included for version tracking.</note>
    <abstract>This dataset contains a Python-based tool to automate the download of open access full-text PDFs using the Unpaywall API. It accepts a CSV of DOIs and retrieves available PDFs into a structured directory. The project includes source code, a requirements file, and documentation.
It was developed to support open science workflows by simplifying access to freely available scholarly publications.</abstract>
    <date>2025-05-29</date>
    <date_type>published</date_type>
    <publisher>University of Birmingham</publisher>
    <id_number>https://doi.org/10.25500/edata.bham.00001292</id_number>
    <data_type>Software</data_type>
    <contact_email>research-data@contacts.bham.ac.uk</contact_email>
    <department>
      <item>School of Engineering, Department of Civil Engineering</item>
    </department>
    <funders>
      <item>other</item>
    </funders>
    <other_funders>
      <item>Research England</item>
      <item>Imfuna Ltd</item>
    </other_funders>
    <projects>
      <item>C-DICE Industrial Secondment</item>
    </projects>
    <secondary_language>en</secondary_language>
    <data_protection_legislation>yes</data_protection_legislation>
    <funder_reqs>yes</funder_reqs>
    <contractual_reqs>yes</contractual_reqs>
    <university_reqs>TRUE</university_reqs>
    <contact_details>
      <name>
        <family>Liu</family>
        <given>Lixu</given>
      </name>
      <id>l.liu.9@bham.ac.uk; lixu@verdemetrix.com</id>
    </contact_details>
    <related_res_rich>
      <item>
        <title>UnpaywallPDFDownloader</title>
        <res_type>web</res_type>
        <url>https://github.com/lixuliu/UnpaywallPDFDownloader.git</url>
        <rel_type>IsIdenticalTo</rel_type>
      </item>
    </related_res_rich>
    <data_prep_note>The script was developed using Python 3.10 and tested in a local environment. Dependencies were managed using pip and listed in requirements.txt. The code reads DOIs from a CSV file, queries the Unpaywall API, and downloads open-access PDFs to a local directory. Logging and error handling were implemented to track failed downloads, which are saved separately in rest_articles.csv. A notification sound was added to indicate completion using librosa and sounddevice.</data_prep_note>
    <language>en</language>
  </eprint>
</eprints>
