GeekTips

Edit a few videos together and add soundtracks in Shotcut (Linux, Mac, Windoze GPL)

98 views18:28

Using LosslessCut (Linux, Mac, Windoze GPL) to make quick editing cuts of mp3s. Trim off the first 22 seconds and last 20 seconds of each file before encoding into a opus chaptered audiobook with freac.

94 views07:40

GeekTips

These are the options that I use for mp3s. But for videos I use SmartCut or keyframe cuts.

90 views07:40

GeekTips

Queue up the files you wish to batch download in Videomass (Linux, Mac, Windoze GPL free) which uses yl-dlp to downlaod from youtube, bitchute, odysee, etc.

89 views18:35

GeekTips

Download all the videos in Videomass

87 views18:36

GeekTips

Queue up all the videos you wish to re-encode to reduce file size keeping 720p.

82 views18:38

GeekTips

-c:v hevc -crf 28 -c:a libopus -b:a 16k -vf scale="-2:720"

The preset I use. - 2 keeps aspect ratio even if you upscale or downscale video. hevc = x265. -crf 28 I use for most videos. -crf 23 for a great documentary or movie and -crf 31 for VHS quality.

83 views18:40

GeekTips

Legogender
Liaspec
Liaspec
Librafeminine
Librafeminine
Libragender
Libragender
Libralesbian
Libralesbian
Libramasculine
Libramasculine
Libramaverique
Libramaverique
Librandrogyne
Librandrogyne
Libranonbinary
Libranonbinary
Lilafluid
Lilafluid
Lingender
Lingender
Linkgender
Littlefluid
Littlefluid
Lolgender
Ludogender
Lunagender
Lunagender
Lunarset
Lunarset

Remove duplicate lines without changing order

nl -w1 gender.txt | sort -k2 | uniq -f1 | sort -n | cut -f2- > output.txt

or this one works too keeping original order

awk '!seen[$0]++' gender.txt > output.txt

Legogender
Liaspec
Librafeminine
Libragender
Libralesbian
Libramasculine
Libramaverique
Librandrogyne
Libranonbinary
Lilafluid
Lingender
Linkgender
Littlefluid
Lolgender
Ludogender
Lunagender
Lunarset

86 viewsedited 20:06

GeekTips

regex searches for and replaces digits up to 13 times after a dash

-
-(\d){13}

77 views22:13

GeekTips

regex searches for a replaces any digits mixed with periods / dots 00.34.77 which are timestamps created by

LosslessCut

you need to use a . instead of using quantifier or whatever it's called. I used 23 or 24 ..... periods.

-(\d)........................

this also works
-(\d+).(\d+).(\d+).(\d+).(\d+).(\d+).(\d+).(\d+)

84 viewsedited 22:16

GeekTips

(\D)-(\d)........................
$1

multiple dashes in filename so (\D) matches non-digit (like abcd, etc.) then a dash and replace with first string $1 which is the letter. Otherwise the last letter gets chopped off at end of filename.

If there is a numeral at the end like Part 1 change the first (\D) to lowercase to indicate digit like so

(\d)-(\d)........................
\1

95 viewsedited 20:02

GeekTips

This book I'm making into an audiobook but the original OCR on the document is pretty much impossible to correct.

pdftotext -layout book.pdf output.txt

91 views21:06

GeekTips

so had to force ocr it

ocrmypdf - -force-ocr book.pdf book_ocr.pdf

and now it's a tad better. Formatting isn't all that important for text to speech.

91 views21:07

GeekTips

Removing hyphens from hyphenated words at the end of a line. Notice for the text to speech to work correctly need to change defi- nitely to definitely and don't change non-partisanship as it's correct as it is.

88 viewsedited 21:44

GeekTips

-$\n\s+
-

is dash
$ says at the end of a line
\n line break
\s is whitespace (blank spaces)
\s+ any amount of whitespace

88 views21:46

GeekTips

It didn't get im- portance nor cir- cumstances as there wasn't any whitespace after the dash -. So search and replace all again using -$\n

91 viewsedited 21:49

GeekTips

now importance and circumstances are correct and non-partisanship isn't changed. Now just have to spell check it before feeding it to ttstool (text to speech)

109 viewsedited 21:50

GeekTips

batch compressed each PDF by about 75%.

made a subdirectory output then

parallel --tag -j 2 ocrmypdf -s -O 2 --skip-big .1 '{}' 'output/{}' ::: *.pdf

Got an OutofMemory Heap error in PDFSam when trying to process a ton of PDFs. So start PDFSam this way with using 2.4GB of memory instead of the 512MB default for java apps.

java -jar -Xmx2400m /opt/pdfsam-basic/pdfsam-basic-4.3.0.jar

As to why PDFSam isn't compressing even with PDF 1.5 checked I have no idea. Thus it's necessary to use ocrmypdf to do the compression.

109 viewsedited 04:19

GeekTips

One PDF had the years 1960 to 1985 and if merged it would have a single entry in the Table of Contents named 1960-1985. I wanted each one from 1960 on to have it's own link in the TOC (table of contents).

Solution was to Split the PDF by Bookmark with PDFSAM.

96 views23:51

GeekTips

To Split by Bookmark choose level 1 and in File names settings right click and choose [BOOKMARK_NAME] and delete PDF_SAM

91 views23:52

GeekTips

Putting spaces between joined capitalized words with regex — renaming files

TheParableofTheFigTree rename it to

The Parable of The Fig Tree

Search and replace all using

([a-z])([A-Z])

replace all with

$1 $2

[a-z] lowercase letters
([a-z]) groups that single letter
[A-Z] uppercase letters
$1 1st string and $2 2nd string

Make sure Case Sensitive Search is checked

103 viewsedited 06:28

About

Blog

Apps

Platform