Posts

Showing posts with the label OCR

2025-01-06: 9th Computational Archival Science Workshop Trip Report

Image
The 9th Computational Archival Science Workshop took place in Washington, DC in December, 2024   The 9th Computational Archival Science Workshop , a part of the IEEE Big Data conference, took place on December 17, 2024 in Washington, DC. The hybrid workshop featured publications from students and professors at information science departments, computer science departments, libraries, and business departments. The topics all focused on integrating artificial intelligence with archives, and the presentations and discussions also prominently featured ethics as well. The workshop included 18 papers from 21 institutions. Session 1: Trends in Computational Archival Science To start the workshop, Jennifer Proctor presented her work, " A Computational Review of the Literature of Computational Archival Science (CAS): Advancing Archival Theory in the Age of the Digital Tsunami and the Vanishing Box Problem ." She analyzed all of the previous Computational Archival Science workshop publ...

2024-07-31: Improving Learning for All: How AI-Driven App Converts Scanned Documents to Readable Text for Low Vision Students

Image
Presenting our extended abstract at the 17th MSVSCC 2024 At the 17th Annual Modeling, Simulation, and Visualization Student Capstone Conference 2024 (MSVSCC), I presented our extended abstract, “ AI-Driven App for Accessibility in Education: Converting Scanned Documents to Readable Text for Students with Low Vision. ” With its intuitively designed user interface, this app is crafted to convert scanned documents into a readable format that seamlessly integrates with users’ existing accessibility tools. The app’s workflow is powered by Nougat (Neural Optical Understanding for Academic Documents), an advanced Optical Character Recognition (OCR) model that converts scientific documents into Markdown. The repository with the code can be found here: https://github.com/bllin001/storymodelers-ocr-documents-art-app . Thanks to @storymodelers @WebSciDL @oducs @vmasc_odu for the opportunity to participate for 2nd year. See you next year!!! @Jhon_gbm12 @joseph_mars7 @boxlessmel @Er...

2022-02-16: About the Use of Amazon Rekognition and the Installation of Associated AWS CLI

Image
  Amazon Rekognition is a cloud service for extracting text in images launched by Amazon. It can find the text in an image and recognize it, as well as output other necessary information provided in this image, such as the location of both the image and  the  text. I'd like to share my hands-on experience in installing and working with it in this blog post. It can be used on both Windows and Linux OS. Part 1: Prerequisites Step 1: Sign up to AWS Follow the instructions to sign up for an AWS account .  Step 2: Create an IAM user account Sign in to the  IAM console and set up user and permissions. You can follow these instructions in part ' Step 2: Create an IAM user account '.  The IAM console is shown in the image below. You can add users, create groups, and set up access in this console. Step 3: Create an access key ID and secret access key The access key ID and secret access key are needed for the AWS  CLI (Command Line Interface) access. In th...