If you, like me, resent every dollar spent on commercial PDF tools,
you might want to know how to change the text content of a PDF without
having to pay for Adobe Acrobat or another PDF tool. I didn't see an
obvious open-source tool that lets you dig into PDF internals, but I
did discover a few useful facts about how PDFs are structured that
I think may prove useful to others (or myself) in the future. They
are recorded here. They are surely not universally applicable --
the PDF standard is truly Byzantine -- but they worked for my case.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/usr/bin/env python3 | |
""" | |
Human quality transcripts from audio files using | |
AssemblyAI for transcription and Google's Gemini for enhancement. | |
Requirements: | |
- AssemblyAI API key (https://www.assemblyai.com/) | |
- Google API key (https://aistudio.google.com/) | |
- Python packages: assemblyai, google-generativeai, pydub |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
"""You help triaging user requests. Given a raw text input, output either DOCS or DEFAULT, according to those definitions: | |
DOCS: if user is asking a seemingly technical question, programming questions or company-specific questions | |
DEFAULT: if user is just chit-chatting or basic knowledge questions | |
==================== | |
Input: hello there | |
Output: DEFAULT |