Skip to content

Commit

Permalink
Update README
Browse files Browse the repository at this point in the history
  • Loading branch information
iMicknl committed Oct 16, 2024
1 parent 93c82bd commit a0461c3
Show file tree
Hide file tree
Showing 2 changed files with 7 additions and 3 deletions.
3 changes: 2 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,8 @@ You can run the application locally or deploy it to Azure, such as on Azure Cont

### Prerequisites

#### Azure Resources
This project utilizes several Azure services, requiring an active Azure subscription. The services used include:

- Azure Document Intelligence
- Azure OpenAI, gpt-4o (2024-08-06) [model availability per region](https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models?tabs=python-secure#standard-deployment-model-availability)
- Azure Speech (East US, West Europe, and Southeast Asia for Azure HD voices)
Expand Down
7 changes: 5 additions & 2 deletions app/app.py
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,9 @@
from utils.speech import podcast_script_to_ssml, text_to_speech

# TODO user configurable prompts

# TODO user configurable tone / style
# TODO user configurable voice
# TODO user configurable podcast length

st.set_page_config(
page_title="Azure Podcast Generator",
Expand Down Expand Up @@ -53,7 +55,7 @@
f"Processing document: {uploaded_file.name}, type: {uploaded_file.type}"
)

# Convert PDF to Markdown with Document Intelligence
# Convert PDF/image/Word files to Markdown with Document Intelligence
if uploaded_file.type in [
"application/pdf",
"image/png",
Expand Down Expand Up @@ -99,6 +101,7 @@
final_audio = True

status.update(label="Finished", state="complete", expanded=False)

# TODO Calculate costs of end to end solution
st.write("Total podcast costs: $0.00")

Expand Down

0 comments on commit a0461c3

Please sign in to comment.