feat: add PDF Keyword Highlighter script (closes #478)#522
Open
SurfyPenguin wants to merge 3 commits intowasmerio:mainfrom
Open
feat: add PDF Keyword Highlighter script (closes #478)#522SurfyPenguin wants to merge 3 commits intowasmerio:mainfrom
SurfyPenguin wants to merge 3 commits intowasmerio:mainfrom
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
PR Title
feat: add PDF Keyword Highlighter script (closes #478 )
Summary
Added a new command-line Python script that highlights specified keywords in PDF files using PyMuPDF, complete with a dedicated folder, README, and entry in the main repository README.
Description
This pull request implements a fully featured PDF keyword highlighter as requested in issue #478, creating a new highlighted output file while keeping the original unchanged.
The changes are as follows:
Highlighter Script/withpdf_highlight.pyand aREADME.mdpage.get_text("words")for fast text extraction-sflag), and punctuation stripping for accurate matching (e.g.,"keyword;"matches"keyword")README.mdto add the new script entry in alphabetical orderChecks
in the repository
in the PR
Thank You,
Amartya Anand