Public
- Public
- Groups
- Popular
- People

Notices by Brahn (brahn@hachyderm.io)

Brahn (brahn@hachyderm.io)'s status on Monday, 20-May-2024 15:58:13 UTC Brahn
in reply to
- cr0n0s:~🐧📡⌨️ 🛠️ #
- Simon Willison
@cr0n0s @simon I just realized how unhappy i was with just using `ttok` to fit it into the model. I wrote this python script to strip out garbage.
I went from 100k+ tokens to `25626` and no loss of context!
In conversation Monday, 20-May-2024 15:58:13 UTC from hachyderm.io permalink
Attachments
1. #!/usr/bin/env python3.11 import webvtt import sys import argparse def vtt_to_markdown(vtt_file, markdown_file): with open(markdown_file, 'w') as md: for caption in webvtt.read(vtt_file): md.write(caption.text + '\n\n') def main(): parser = argparse.ArgumentParser(description="Convert VTT to Markdown.") parser.add_argument('vtt_file', nargs='?', type=argparse.FileType('r'), default=sys.stdin, help="Input VTT file") parser.add_argument('markdown_file', nargs='?', type=str, default='output.md', help="Output Markdown file") args = parser.parse_args() vtt_file_path = args.vtt_file.name markdown_file_path = args.markdown_file vtt_to_markdown(vtt_file_path, markdown_file_path) print(f"Conversion complete. Markdown file saved as {markdown_file_path}") if __name__ == "__main__": main()
  https://media.hachyderm.io/media_attachments/files/112/458/865/762/880/668/original/fe45e97f9ebe2e49.png
Brahn (brahn@hachyderm.io)'s status on Friday, 17-May-2024 22:23:54 UTC Brahn
in reply to
- cr0n0s:~🐧📡⌨️ 🛠️ #
- Simon Willison
@cr0n0s
I don't know if this is useful, but I use it alot.
```
set -l vttfilename (yt-dlp --write-auto-sub --skip-download -o '%(id)s.%(ext)s' 'https://www.youtube.com/watch?v=IuF0GlO2Myk' 2>&1 | rg "Destination: " | rg -o '[a-zA-Z0-9_-]+\.en\.vtt')
cat $vttfilename| ttok -m gpt-4 -t 120000 | llm -m 4o 'convert this vtt file to readable prose'
```
This requires tooling from @simon; `ttok` and `llm`
Also - this is fish shell.
Hope it's useful!
In conversation Friday, 17-May-2024 22:23:54 UTC from hachyderm.io permalink
Attachments
1. Pulumi Tutorial: Introduction, Benefits, and Demo of Modern Infrastructure as Code
  
  from KodeKloud
  
  🆓Join our Slack Community for FREE: https://kode.wiki/JoinOurSlackCommunity Welcome to our comprehensive tutorial on Pulumi, the cutting-edge infrastructure...

User actions

#nerdcore #antifacist #antiracist #anticonservative #progressive #he/him #poly #pulonium4trump

Tags

(None)

ActivityPub: Remote Profile

Following 0

Followers 0

Groups 0

Statistics

User ID: 68849

Member since: 17 May 2024

Notices: 2

Daily average: 0

Feeds

Atom