PDF extract pointers?

agaziel2 · November 18, 2024, 9:05pm

Hi,

Any pointers on a library to extract text from pdf files that works from a function? pdf-parse and others simply fail. The file is uploaded to a blob, and from there should be pasrsed and passed on for embedding in pinecone. It works fine with other filet types, but it seems as if text extraction from a pdf in a serverless environment is quite a challenge…

Any experience? REcommendations?

my site is on test.riskgpt.io

hrishikesh · November 20, 2024, 12:56pm

Nothing official from Netlify’s end. You might have better luck in asking this in a wider web-dev forum than something Netlify-specific.

Topic		Replies	Views
Hosting a file along with my function Support lambda-functions	34	15929	January 1, 2025
Create downloadable file in Lambda functions? Support lambda-functions	1	959	June 5, 2020
Using pdf-fill-form with native dependencies on netlify function Support lambda-functions	2	622	March 16, 2021
Generate (large) files with a serverless function Support netlify-newbie , netlify-large-media-nlm	5	879	July 26, 2022
Reading static file in function Support lambda-functions	3	1464	November 5, 2020

PDF extract pointers?

Related topics