Skip to content

Instantly share code, notes, and snippets.

View shdwkl's full-sized avatar

shdwkl

  • 我的鸡巴
View GitHub Profile
@shdwkl
shdwkl / vtt2text
Last active March 21, 2025 11:46 — forked from glasslion/vtt2text.py
This script convert youtube subtitle file(vtt) to plain text (with no duplicated).
#!/usr/bin/env python3
import re
import os
import glob
def vtt_to_txt(vtt_filepath, txt_filepath):
"""Converts a VTT file to plain text, removing timestamps, header, and duplicates.
"""
try:
with open(vtt_filepath, 'r', encoding='utf-8') as vtt_file, \
- This page is a collection of some of the Advanced queries from the [[Datalog]] channel on the [[Logseq/Discord]] server. #datalog
id:: 61db13f4-75e8-4f87-ad60-3ac3479c5fc8
- ### Resources
- [link: The first message on the datalog channel](https://discord.com/channels/725182569297215569/743139225746145311/743139795865174119)
- [link: Logseq docs - Advanced queries](https://docs.logseq.com/#/page/advanced%20queries)
- [link: Logseq datascript schema](https://gist.github.com/tiensonqin/9a40575827f8f63eec54432443ecb929)
- [link: Logseq frontend db model](https://github.com/logseq/logseq/blob/master/src/main/frontend/db/model.cljs)
- [link: How to Graph Your Data - talk by Paula Gearon](https://youtu.be/tbVwmFBnfo4)
- [link: Domain modelling with datalog - talk by Norbert Wojtowicz](https://youtu.be/oo-7mN9WXTw)
- [link: Athens Research ClojureFam](https://github.com/athensresearch/ClojureFam)