Transkript: Unterschied zwischen den Versionen
Zur Navigation springen
Zur Suche springen
(5 dazwischenliegende Versionen desselben Benutzers werden nicht angezeigt) | |||
Zeile 1: | Zeile 1: | ||
Ein Transkript ist eine Reduzierung gesprochenen Wortes auf eine schriftliche Form. Dabei geht [[Kontext]] zum Teil verloren. | |||
= Auto-Transcript = | = Auto-Transcript = | ||
== Whisper == | |||
{{Hauptartikel|Whisper (OpenAI)}} | |||
Steps: | |||
# install Python 3.9.13 | |||
## setup pip (systemvariables) to go to 3.9.13 | |||
# install [https://pytorch.org/ pytorch] via pip | |||
# install [https://github.com/openai/whisper whsiper] via pip | |||
## pip install git+https://github.com/openai/whisper.git | |||
https://github.com/openai/whisper/discussions/categories/show-and-tell | |||
*To Wiki: https://www.mediawiki.org/wiki/Extension:OAuth/de | |||
== Word == | == Word == | ||
300 minuten / Monat max. | 300 minuten / Monat max. | ||
Zeile 64: | Zeile 79: | ||
:Automatic Speech Recognition (ASR) - German https://github.com/AASHISHAG/asr-german | :Automatic Speech Recognition (ASR) - German https://github.com/AASHISHAG/asr-german | ||
[[Kategorie:Idee]][[Kategorie:Tool]] | [[Kategorie:Idee]][[Kategorie:Tool]] | ||
: Whisper https://openai.com/blog/whisper/ |
Aktuelle Version vom 30. Dezember 2022, 15:14 Uhr
Ein Transkript ist eine Reduzierung gesprochenen Wortes auf eine schriftliche Form. Dabei geht Kontext zum Teil verloren.
Auto-Transcript
Whisper
Steps:
- install Python 3.9.13
- setup pip (systemvariables) to go to 3.9.13
- install pytorch via pip
- install whsiper via pip
- pip install git+https://github.com/openai/whisper.git
https://github.com/openai/whisper/discussions/categories/show-and-tell
Word
300 minuten / Monat max.
Word to Wiki convert
Regex
Find:
0
Replace:
|-
| 0
Find:
Speaker
Replace:
||
Find:
Transcript
Replace:
{| class="wikitable"
|+ Transcript
|-
! Timestamp !! Speaker !! Text
Find:
Replace:
||
Find:
||
|-
Replace:
|-
Find:
||
Replace:
|}
Find(enable regex):
(?<=\d\d) \|\|\n
Replace:
|| ||
Find(enable regex):
^Audio file \|\|$
.+
\|\}\|\|
\{\|
Replace:
{|
#Done.
Quellen
- stackoverflow: better way to automate mouse&keyboard using pyautogui[1]
- Tool: pywinauto
- Sleep vs tight-loop: https://stackoverflow.com/questions/34361728/windows-desktop-gui-automation-using-python-sleep-vs-tight-loop
- Word-Alternativen
- AssemblyAI https://www.youtube.com/watch?v=rTWM5WuPhlQ
- pydub, ffmpeg and SpeechRecognition https://pythonbasics.org/transcribe-audio/
- Automatic Speech Recognition (ASR) - German https://github.com/AASHISHAG/asr-german
- Whisper https://openai.com/blog/whisper/
- ↑ stackoverflow: better way to automate mouse&keyboard using pyautogui https://stackoverflow.com/questions/42412157/better-way-to-automate-mousekeyboard-using-pyautogui