Semantic Scholar logo

News: Check out our newer demo at Paper To HTML.

Welcome to SciA11y!

This is an experimental prototype created by Semantic Scholar. It provides access to 1.5M open access scientific documents in accessible HTML format. Our system uses machine learning techniques to extract the semantic content of scientific papers and formats it in HTML for easier reading. Because of our reliance on statistical machine learning techniques, some errors are inevitable. We will continue to improve upon our models and would love to hear your feedback in the meantime. The papers included in this demo come from a static dataset; all papers have CC (non-ND) licenses and were published in or before April 2020. More about this prototype...

You can also upload your own PDF, which we process and render in HTML for reading. You can try this functionality here.

Example papers

Data Security, Privacy, Availability and Integrity in Cloud Computing: Issues and Current Solutions
2016 Sultan Aldossary, William Allen

Application of acidic accelerator for production of pure hydrogen from NaBH4
2014 Wameath S. Abdul-Majeed, Muhammad T. Arslan, William B. Zimmerman

Contribution of Chronic Disease to the Burden of Disability
2011 Bart Klijs, Wilma J. Nusselder, Caspar W. Looman et al.

Scientific Article Summarization Using Citation-Context and Article's Discourse Structure
2017 Arman Cohan, Nazli Goharian

A synthesis of recent analyses of human resources for health requirements and labour market dynamics in high-income OECD countries
2016 Gail Tomblin Murphy, Stephen Birch, Adrian MacKenzie et al.

Internet Access by People with Intellectual Disabilities: Inequalities and Opportunities
2013 Darren Chadwick, Caroline Wesson, Chris Fullwood

Integrated watershed management: evolution, development and emerging trends
2016 Guangyu Wang, Shari Mang, Haisheng Cai et al.

Gd(III) ion-chelated supramolecular assemblies composed of PGMA-based polycations for effective biomedical applications
2015 Yu Zhao, Shun Duan, Bingran Yu et al.

Life cycle assessment of construction and renovation of sewer systems using a detailed inventory tool
2016 Serni Morera, Christian Remy, Joaquim Comas et al.

Multi-domain Neural Network Language Generation for Spoken Dialogue Systems
2016 Tsung-Hsien Wen, Milica Gasic, Nikola Mrksic et al.

Preprint

To find out more about how we created this prototype, please read our preprint. Accessible PDF available here.

Team

Feedback

Please address questions or feedback to Lucy Lu Wang or Jonathan Bragg.