Transkript: Unterschied zwischen den Versionen
Zur Navigation springen
Zur Suche springen
(18 dazwischenliegende Versionen desselben Benutzers werden nicht angezeigt) | |||
Zeile 1: | Zeile 1: | ||
Ein Transkript ist eine Reduzierung gesprochenen Wortes auf eine schriftliche Form. Dabei geht [[Kontext]] zum Teil verloren. | |||
= Auto-Transcript = | = Auto-Transcript = | ||
== | == Whisper == | ||
{{Hauptartikel|Whisper (OpenAI)}} | |||
: | Steps: | ||
# install Python 3.9.13 | |||
## setup pip (systemvariables) to go to 3.9.13 | |||
# install [https://pytorch.org/ pytorch] via pip | |||
# install [https://github.com/openai/whisper whsiper] via pip | |||
## pip install git+https://github.com/openai/whisper.git | |||
https://github.com/openai/whisper/discussions/categories/show-and-tell | |||
*To Wiki: https://www.mediawiki.org/wiki/Extension:OAuth/de | |||
== Word == | |||
300 minuten / Monat max. | |||
=== Word to Wiki convert === | |||
==== Regex ==== | |||
'''Find:'''<br><nowiki> | |||
</nowiki><br><nowiki> | |||
0</nowiki><br><nowiki> | |||
</nowiki>'''Replace:'''<br><nowiki> | |||
</nowiki><br><nowiki> | |||
|-</nowiki><br><nowiki> | |||
| 0</nowiki><br><nowiki> | |||
</nowiki>'''Find:'''<br><nowiki> | |||
Speaker</nowiki><br><nowiki> | |||
</nowiki>'''Replace:'''<br><nowiki> | |||
||</nowiki><br><nowiki> | |||
</nowiki>'''Find:'''<br><nowiki> | |||
Transcript </nowiki><br><nowiki> | |||
</nowiki>'''Replace:'''<br><nowiki> | |||
{| class="wikitable"</nowiki><br><nowiki> | |||
|+ Transcript</nowiki><br><nowiki> | |||
|-</nowiki><br><nowiki> | |||
! Timestamp !! Speaker !! Text</nowiki><br><nowiki> | |||
</nowiki>'''Find:'''<br><nowiki> | |||
</nowiki><br><nowiki> | |||
</nowiki><br><nowiki> | |||
</nowiki><br><nowiki> | |||
</nowiki>'''Replace:'''<br><nowiki> | |||
||</nowiki><br><nowiki> | |||
</nowiki><br><nowiki> | |||
</nowiki>'''Find:'''<br><nowiki> | |||
||</nowiki><br><nowiki> | |||
|-</nowiki><br><nowiki> | |||
</nowiki>'''Replace:'''<br><nowiki> | |||
</nowiki><br><nowiki> | |||
|-</nowiki><br><nowiki> | |||
</nowiki>'''Find:'''<br><nowiki> | |||
||</nowiki><br><nowiki> | |||
</nowiki><br><nowiki> | |||
</nowiki>'''Replace:'''<br><nowiki> | |||
</nowiki><br><nowiki> | |||
|}</nowiki><br><nowiki> | |||
</nowiki>'''Find(enable regex):'''<br><nowiki> | |||
(?<=\d\d) \|\|\n</nowiki><br><nowiki> | |||
</nowiki>'''Replace:'''<br><nowiki> | |||
|| ||</nowiki><br><nowiki> | |||
</nowiki><br><nowiki> | |||
</nowiki>'''Find(enable regex):'''<br><nowiki> | |||
^Audio file \|\|$</nowiki><br><nowiki> | |||
.+</nowiki><br><nowiki> | |||
\|\}\|\|</nowiki><br><nowiki> | |||
\{\|</nowiki><br><nowiki> | |||
</nowiki>'''Replace:'''<br><nowiki> | |||
{|</nowiki><br><nowiki> | |||
#Done.</nowiki> | |||
== Quellen == | |||
;stackoverflow<nowiki>:</nowiki> better way to automate mouse&keyboard using pyautogui<ref>stackoverflow: better way to automate mouse&keyboard using pyautogui https://stackoverflow.com/questions/42412157/better-way-to-automate-mousekeyboard-using-pyautogui</ref> | |||
:Tool<nowiki>:</nowiki> [[pywinauto]] | |||
:Sleep vs tight-loop<nowiki>:</nowiki> https://stackoverflow.com/questions/34361728/windows-desktop-gui-automation-using-python-sleep-vs-tight-loop | |||
;Word-Alternativen: | |||
:AssemblyAI https://www.youtube.com/watch?v=rTWM5WuPhlQ | |||
:pydub, ffmpeg and SpeechRecognition https://pythonbasics.org/transcribe-audio/ | |||
:Automatic Speech Recognition (ASR) - German https://github.com/AASHISHAG/asr-german | |||
[[Kategorie:Idee]][[Kategorie:Tool]] | [[Kategorie:Idee]][[Kategorie:Tool]] | ||
: Whisper https://openai.com/blog/whisper/ |
Aktuelle Version vom 30. Dezember 2022, 15:14 Uhr
Ein Transkript ist eine Reduzierung gesprochenen Wortes auf eine schriftliche Form. Dabei geht Kontext zum Teil verloren.
Auto-Transcript
Whisper
Steps:
- install Python 3.9.13
- setup pip (systemvariables) to go to 3.9.13
- install pytorch via pip
- install whsiper via pip
- pip install git+https://github.com/openai/whisper.git
https://github.com/openai/whisper/discussions/categories/show-and-tell
Word
300 minuten / Monat max.
Word to Wiki convert
Regex
Find:
0
Replace:
|-
| 0
Find:
Speaker
Replace:
||
Find:
Transcript
Replace:
{| class="wikitable"
|+ Transcript
|-
! Timestamp !! Speaker !! Text
Find:
Replace:
||
Find:
||
|-
Replace:
|-
Find:
||
Replace:
|}
Find(enable regex):
(?<=\d\d) \|\|\n
Replace:
|| ||
Find(enable regex):
^Audio file \|\|$
.+
\|\}\|\|
\{\|
Replace:
{|
#Done.
Quellen
- stackoverflow: better way to automate mouse&keyboard using pyautogui[1]
- Tool: pywinauto
- Sleep vs tight-loop: https://stackoverflow.com/questions/34361728/windows-desktop-gui-automation-using-python-sleep-vs-tight-loop
- Word-Alternativen
- AssemblyAI https://www.youtube.com/watch?v=rTWM5WuPhlQ
- pydub, ffmpeg and SpeechRecognition https://pythonbasics.org/transcribe-audio/
- Automatic Speech Recognition (ASR) - German https://github.com/AASHISHAG/asr-german
- Whisper https://openai.com/blog/whisper/
- ↑ stackoverflow: better way to automate mouse&keyboard using pyautogui https://stackoverflow.com/questions/42412157/better-way-to-automate-mousekeyboard-using-pyautogui