Transkript: Unterschied zwischen den Versionen
Zur Navigation springen
Zur Suche springen
(→Regex) |
|||
(7 dazwischenliegende Versionen desselben Benutzers werden nicht angezeigt) | |||
Zeile 1: | Zeile 1: | ||
Ein Transkript ist eine Reduzierung gesprochenen Wortes auf eine schriftliche Form. Dabei geht [[Kontext]] zum Teil verloren. | |||
= Auto-Transcript = | = Auto-Transcript = | ||
== Whisper == | |||
{{Hauptartikel|Whisper (OpenAI)}} | |||
Steps: | |||
# install Python 3.9.13 | |||
## setup pip (systemvariables) to go to 3.9.13 | |||
# install [https://pytorch.org/ pytorch] via pip | |||
# install [https://github.com/openai/whisper whsiper] via pip | |||
## pip install git+https://github.com/openai/whisper.git | |||
https://github.com/openai/whisper/discussions/categories/show-and-tell | |||
*To Wiki: https://www.mediawiki.org/wiki/Extension:OAuth/de | |||
== Word == | == Word == | ||
300 minuten / Monat max. | 300 minuten / Monat max. | ||
=== Word to Wiki convert === | === Word to Wiki convert === | ||
==== Regex ==== | ==== Regex ==== | ||
'''Find:'''<br><nowiki> | |||
</nowiki><br><nowiki> | </nowiki><br><nowiki> | ||
0</nowiki><br><nowiki> | 0</nowiki><br><nowiki> | ||
'''Replace:''' | </nowiki>'''Replace:'''<br><nowiki> | ||
</nowiki><br><nowiki> | </nowiki><br><nowiki> | ||
|-</nowiki><br><nowiki> | |-</nowiki><br><nowiki> | ||
| 0</nowiki><br><nowiki> | | 0</nowiki><br><nowiki> | ||
'''Find:''' | </nowiki>'''Find:'''<br><nowiki> | ||
Speaker</nowiki><br><nowiki> | Speaker</nowiki><br><nowiki> | ||
'''Replace:''' | </nowiki>'''Replace:'''<br><nowiki> | ||
||</nowiki><br><nowiki> | ||</nowiki><br><nowiki> | ||
'''Find:''' | </nowiki>'''Find:'''<br><nowiki> | ||
Transcript </nowiki><br><nowiki> | Transcript </nowiki><br><nowiki> | ||
'''Replace:''' | </nowiki>'''Replace:'''<br><nowiki> | ||
{| class="wikitable"</nowiki><br><nowiki> | {| class="wikitable"</nowiki><br><nowiki> | ||
|+ Transcript</nowiki><br><nowiki> | |+ Transcript</nowiki><br><nowiki> | ||
|-</nowiki><br><nowiki> | |-</nowiki><br><nowiki> | ||
! Timestamp !! Speaker !! Text</nowiki><br><nowiki> | ! Timestamp !! Speaker !! Text</nowiki><br><nowiki> | ||
'''Find:''' | </nowiki>'''Find:'''<br><nowiki> | ||
</nowiki><br><nowiki> | </nowiki><br><nowiki> | ||
</nowiki><br><nowiki> | </nowiki><br><nowiki> | ||
</nowiki><br><nowiki> | </nowiki><br><nowiki> | ||
'''Replace:''' | </nowiki>'''Replace:'''<br><nowiki> | ||
||</nowiki><br><nowiki> | ||</nowiki><br><nowiki> | ||
</nowiki><br><nowiki> | </nowiki><br><nowiki> | ||
'''Find:''' | </nowiki>'''Find:'''<br><nowiki> | ||
||</nowiki><br><nowiki> | ||</nowiki><br><nowiki> | ||
|-</nowiki><br><nowiki> | |-</nowiki><br><nowiki> | ||
'''Replace:''' | </nowiki>'''Replace:'''<br><nowiki> | ||
</nowiki><br><nowiki> | </nowiki><br><nowiki> | ||
|-</nowiki><br><nowiki> | |-</nowiki><br><nowiki> | ||
'''Find:''' | </nowiki>'''Find:'''<br><nowiki> | ||
||</nowiki><br><nowiki> | ||</nowiki><br><nowiki> | ||
</nowiki><br><nowiki> | </nowiki><br><nowiki> | ||
'''Replace:''' | </nowiki>'''Replace:'''<br><nowiki> | ||
</nowiki><br><nowiki> | </nowiki><br><nowiki> | ||
|}</nowiki><br><nowiki> | |}</nowiki><br><nowiki> | ||
'''Find(enable regex):''' | </nowiki>'''Find(enable regex):'''<br><nowiki> | ||
(?<=\d\d) \|\|\n</nowiki><br><nowiki> | (?<=\d\d) \|\|\n</nowiki><br><nowiki> | ||
'''Replace:''' | </nowiki>'''Replace:'''<br><nowiki> | ||
|| ||</nowiki><br><nowiki> | || ||</nowiki><br><nowiki> | ||
</nowiki><br><nowiki> | </nowiki><br><nowiki> | ||
'''Find(enable regex):''' | </nowiki>'''Find(enable regex):'''<br><nowiki> | ||
^Audio file \|\|$</nowiki><br><nowiki> | ^Audio file \|\|$</nowiki><br><nowiki> | ||
.+</nowiki><br><nowiki> | .+</nowiki><br><nowiki> | ||
\|\}\|\|</nowiki><br><nowiki> | \|\}\|\|</nowiki><br><nowiki> | ||
\{\|</nowiki><br><nowiki> | \{\|</nowiki><br><nowiki> | ||
'''Replace:''' | </nowiki>'''Replace:'''<br><nowiki> | ||
{|</nowiki><br> | {|</nowiki><br><nowiki> | ||
<nowiki>#Done.</nowiki> | #Done.</nowiki> | ||
== Quellen == | == Quellen == | ||
Zeile 62: | Zeile 77: | ||
:AssemblyAI https://www.youtube.com/watch?v=rTWM5WuPhlQ | :AssemblyAI https://www.youtube.com/watch?v=rTWM5WuPhlQ | ||
:pydub, ffmpeg and SpeechRecognition https://pythonbasics.org/transcribe-audio/ | :pydub, ffmpeg and SpeechRecognition https://pythonbasics.org/transcribe-audio/ | ||
:Automatic Speech Recognition (ASR) - German https://github.com/AASHISHAG/asr-german | |||
[[Kategorie:Idee]][[Kategorie:Tool]] | [[Kategorie:Idee]][[Kategorie:Tool]] | ||
: Whisper https://openai.com/blog/whisper/ |
Aktuelle Version vom 30. Dezember 2022, 15:14 Uhr
Ein Transkript ist eine Reduzierung gesprochenen Wortes auf eine schriftliche Form. Dabei geht Kontext zum Teil verloren.
Auto-Transcript
Whisper
Steps:
- install Python 3.9.13
- setup pip (systemvariables) to go to 3.9.13
- install pytorch via pip
- install whsiper via pip
- pip install git+https://github.com/openai/whisper.git
https://github.com/openai/whisper/discussions/categories/show-and-tell
Word
300 minuten / Monat max.
Word to Wiki convert
Regex
Find:
0
Replace:
|-
| 0
Find:
Speaker
Replace:
||
Find:
Transcript
Replace:
{| class="wikitable"
|+ Transcript
|-
! Timestamp !! Speaker !! Text
Find:
Replace:
||
Find:
||
|-
Replace:
|-
Find:
||
Replace:
|}
Find(enable regex):
(?<=\d\d) \|\|\n
Replace:
|| ||
Find(enable regex):
^Audio file \|\|$
.+
\|\}\|\|
\{\|
Replace:
{|
#Done.
Quellen
- stackoverflow: better way to automate mouse&keyboard using pyautogui[1]
- Tool: pywinauto
- Sleep vs tight-loop: https://stackoverflow.com/questions/34361728/windows-desktop-gui-automation-using-python-sleep-vs-tight-loop
- Word-Alternativen
- AssemblyAI https://www.youtube.com/watch?v=rTWM5WuPhlQ
- pydub, ffmpeg and SpeechRecognition https://pythonbasics.org/transcribe-audio/
- Automatic Speech Recognition (ASR) - German https://github.com/AASHISHAG/asr-german
- Whisper https://openai.com/blog/whisper/
- ↑ stackoverflow: better way to automate mouse&keyboard using pyautogui https://stackoverflow.com/questions/42412157/better-way-to-automate-mousekeyboard-using-pyautogui