update,

2025-02-01 02:03:59 +08:00
commit ba549854bd
6 changed files with 94 additions and 0 deletions
--- a/.gitattributes
+++ b/.gitattributes
@@ -0,0 +1,31 @@
+*.mp4 filter=lfs diff=lfs merge=lfs
+*.zip filter=lfs diff=lfs merge=lfs
+*.7z filter=lfs diff=lfs merge=lfs
+*.tar.gz filter=lfs diff=lfs merge=lfs
+*.jpg filter=lfs diff=lfs merge=lfs
+*.png filter=lfs diff=lfs merge=lfs
+*.avif filter=lfs diff=lfs merge=lfs
+*.webm filter=lfs diff=lfs merge=lfs
+*.mkv filter=lfs diff=lfs merge=lfs
+
+# Documents
+*.doc	diff=astextplain
+*.DOC	diff=astextplain
+*.docx	diff=astextplain
+*.DOCX	diff=astextplain
+*.dot	diff=astextplain
+*.DOT	diff=astextplain
+*.pdf	diff=astextplain
+*.PDF	diff=astextplain
+*.rtf	diff=astextplain
+*.RTF	diff=astextplain
+
+*.gif filter=lfs diff=lfs merge=lfs
+*.GIF filter=lfs diff=lfs merge=lfs
+*.bmp filter=lfs diff=lfs merge=lfs
+*.BMP filter=lfs diff=lfs merge=lfs
+*.tiff filter=lfs diff=lfs merge=lfs
+*.TIFF filter=lfs diff=lfs merge=lfs
+*.wav filter=lfs diff=lfs merge=lfs
+*.WAV filter=lfs diff=lfs merge=lfs
+*.log filter=lfs diff=lfs merge=lfs
--- a/.gitignore
+++ b/.gitignore
@@ -0,0 +1 @@
+**/~*.*
--- a/gitUpdate.bat
+++ b/gitUpdate.bat
@@ -0,0 +1,7 @@
+git status .
+
+@pause
+
+git add .
+git commit -m"update majkp23,"
+start git push
--- a/gitUpdate.sh
+++ b/gitUpdate.sh
@@ -0,0 +1,16 @@
+#!/usr/bin/env bash
+
+set -ex
+
+git config --global http.version HTTP/1.1
+git config --global lfs.allowincompletepush true
+git config --global lfs.locksverify true
+git config --global http.postBuffer 5368709120
+
+git add .
+
+git commit -m 'update,'
+
+git push
+
+echo "done"
--- a/notes.md
+++ b/notes.md
@@ -0,0 +1,36 @@
+### How it works
+
+1. It's similar to ChatGPT, but simpler. User can sign in, sign out and store conversations with LLM model
+   Q: 3rd-party auth provider (e.g. google auth) ?
+
+2. Important: It should be able to use Azure API for GPT, because next week OpenAI closes access to Hong Kong.
+
+3. You can have a conversation like in ChatGPT, you can type it or dictate it to microphone, it will be sent to Whisper API (Azure supports it)
+4. Answer will be received from API, and attached with audio that will be generated by eleven labs API.
+   It needs to detect the language, so if the answer is in Spanish, then audio will be Spanish.
+   If Japanese - japanese audio, etc. you can continue the conversation, so it has a context of this conversation for next answer
+5. Audio is looped in browser by default using simple HTML <loop>.
+
+   Remarks: LangChain can be use but the answer from ai-api will be provided as is for below api
+
+   - azure api,
+   - whisper api,
+   - eleven labs api
+
+6. Answers are stored in server - and audios too, and can be obtained via rest api, and also from this simple web ui
+   Remarks: ChatGPT ui alike will be provided
+   Q: ui example for audios part ? for rest-api, any server provided (e.g storage) ?
+
+7. You can use any server framework like node.js, same for frontend - react for instance
+   Q: will your side cover related azure and api fee in develop/bug-fix stage ?
+
+### Details needed from a contractor
+
+1. Name and surname
+2. HKID number
+   Q: only first 4 digit of HKID number (eng including) provided is that OK ?
+
+Q: 唐突一問你呢份係功課黎?!
+
+what is the level of completion of your project (POC) ?
+any chance if i leave you my company contact instead of HKID ?
--- a/spec.png
+++ b/spec.png