Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Notebook usage BPA #178

Open
natolira opened this issue Sep 30, 2024 · 1 comment
Open

Notebook usage BPA #178

natolira opened this issue Sep 30, 2024 · 1 comment
Labels
enhancement New feature or request

Comments

@natolira
Copy link

Semantic-link-labs is great with BPA over semantic models and reports and several other user-friendly functions. Thanks Kovalsky!

What I'm seeing in a large corporation is the fear to adopt Fabric due to the high consumption potential of some bad-written notebooks.

And in many companies regular users don't have admin rights to the workspace. So we need a way to programatically check/update workspace settings to reflect the best practices.

I can think about several ways we can think to enforce the best practices to avoid this high consumption (especially because there are a lot of unexperienced notebook writers that might lead to poor code):

-Set the Default pool for workspace in spark settings to the starter pool single node

-Turn on high concurrency mode to notebooks and data pipelines

-Update the runtime to the most recent GA (this is not updated automatically)

-Check the definition of data pipelines to find multiple notebooks call (especially within loops) and force them to have the same tag to be executed under the same session

-Identify several standard session initialized by the same user or that was timed-out (to notify them to use high concurrency session or remember to manually stop the session when done)

And I'm sure there might be other actions to guarantee wvery workspace respects this guardrails

Thanks again!

@natolira natolira added the enhancement New feature or request label Sep 30, 2024
@m-kovalsky
Copy link
Collaborator

Thanks for your feedback! I believe that pretty much all of these can be checked programmatically via native functions (which call the relevant APIs) within semantic link labs. The one missing is data pipeline definitions (which is coming in the next release).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants