[Documentation]Instructions on how to take your application to production #345

elvaliuliuliu · 2019-11-20T21:49:46Z

Currently, there are some questions asked by customers that how they can run spark dotnet application in different scenarios. This PR gathers most commonly asked scenarios and provides general instructions on how customer can package their applications and submit jobs in such scenarios.

Sync Upstream

imback82 · 2019-11-20T21:55:05Z

cc: @bamurtaugh

bamurtaugh · 2019-11-20T22:24:38Z

docs/take-to-prod.md

@@ -0,0 +1,108 @@
+Taking your Spark .Net Application to Production


Do we need to call it ".NET for Apache Spark" or ".NET for Spark" application instead (since we steer away from calling it Spark.NET publicly)? Also, I think ".NET" should be all caps for consistency.

Sure, I will change it to .NET for Apache Spark for now. And keep all .NET caps. Thanks!

docs/take-to-prod.md

bamurtaugh · 2019-11-20T22:26:48Z

docs/take-to-prod.md

+This how-to provides general instructions on how to take your .NET for Apache Spark application to production.
+In this documentation, we will summary the most commonly asked scenarios when running Spark .Net Application.
+And you will also learn how to package your application and submit your application with [spark-submit](https://spark.apache.org/docs/latest/submitting-applications.html) and [Apachy Livy](https://livy.incubator.apache.org/).
+- [How to take your application to production when you have single dependency](#how-to-take-your-application-to-production-when-you-have-single-dependency)


Suggested change

- [How to take your application to production when you have single dependency](#how-to-take-your-application-to-production-when-you-have-single-dependency)

- [How to take your application to production when you have a single dependency](#how-to-take-your-application-to-production-when-you-have-a-single-dependency)

Not sure if we can change the phrasing here and still have it be precise, but "a single dependency" might sound a little cleaner.

Alternatively, could we make these headings either more concise or more precise? i.e., either remove the "How to take your application to production" part since that phrase is already in the article title, or add a phrase that more specifically states what it means to take an app to production (does it just mean running spark-submit, so we could say something like "Deploy app with a single dependency"?).

Suggested change

- [How to take your application to production when you have single dependency](#how-to-take-your-application-to-production-when-you-have-single-dependency)

- [Single dependency](#single-dependency)

Suggested change

- [How to take your application to production when you have single dependency](#how-to-take-your-application-to-production-when-you-have-single-dependency)

- [How to deploy your application when you have a single dependency](#how-to-deploy-your-application-when-you-have-a-single-dependency)

Thanks for your suggestion! I would prefer the second one which I think is concise and precise.

docs/take-to-prod.md

bamurtaugh · 2019-11-20T22:56:56Z

docs/take-to-prod.md

+```
+#### 2. Using Apache Livy
+- Please see below as an example of running your app with Apache Livy in Scenario 3 and Scenario 5.
+And you should use `"files": ["adl://<cluster name>.azuredatalakestore.net/<some dir>/nugetLibrary.dll"]` in Scenario 4.


Suggested change

And you should use `"files": ["adl://<cluster name>.azuredatalakestore.net/<some dir>/nugetLibrary.dll"]` in Scenario 4.

Additionally, you should use `"files": ["adl://<cluster name>.azuredatalakestore.net/<some dir>/nugetLibrary.dll"]` in Scenario 4.

I have made the changed to resolve all the comments (except few which need some input). Thanks so much @bamurtaugh for your comments and feedback!

docs/take-to-prod.md

suhsteve · 2019-11-22T23:28:01Z

docs/take-to-prod.md

+#### Scenario 4. SparkSession code references a function from a Nuget package that has been installed in the csproj
+This would be the use case when `SparkSession` code references a function from a Nuget package in the same project (e.g. mySparkApp.csproj).
+#### Scenario 5. SparkSession code references a function from a DLL on the user's machine
+This would be the use case when `SparkSession` code reference business logic (UDFs) on the user's machine (e.g. `SparkSession` code in the mySparkApp.csproj and businessLogic.dll on a different machine). 


Why does businessLogic.dll be from a different machine ?

docs/take-to-prod.md

suhsteve · 2019-11-22T23:33:01Z

docs/take-to-prod.md

+```shell
+{
+    "file": "adl://<cluster name>.azuredatalakestore.net/<some dir>/microsoft-spark-<spark_majorversion.spark_minorversion.x>-<spark_dotnet_version>.jar",
+    "className": "org.apache.spark.deploy.dotnet.DotnetRunner",
+    "files": [“adl://<cluster name>.azuredatalakestore.net/<some dir>/businessLogic.dll" ],
+    "args": ["dotnet","adl://<cluster name>.azuredatalakestore.net/<some dir>/mySparkApp.dll","<app arg 1>","<app arg 2>,"...","<app arg n>"]
+}
+```


Should just provide the zip example.

Thanks for your comments! I have resolved all of them in this pr #349(I open a new pr #349 cause I could not edit on this one and will close this soon). Let's discuss and review in the new pr. Thanks for your understanding and sorry for the inconvenience.

elvaliuliuliu · 2019-11-23T02:19:47Z

Closing this one and open a new pr #349 to move forward from there. Thanks!

elvaliuliuliu and others added 5 commits September 6, 2019 10:40

Merge pull request #1 from dotnet/master

e6ca264

Sync Upstream

Merge remote-tracking branch 'upstream/master'

db3808f

Merge branch 'master' of https://github.com/dotnet/spark

cdfc8b7

Merge remote-tracking branch 'upstream/master'

d6cbd28

Init

606a2c6

imback82 requested review from imback82, rapoth and suhsteve November 20, 2019 21:54

Merge branch 'master' into elva/take-to-prod

c197f89

bamurtaugh reviewed Nov 20, 2019

View reviewed changes

resolve comments

7abab90

suhsteve reviewed Nov 22, 2019

View reviewed changes

elvaliuliuliu mentioned this pull request Nov 23, 2019

[Documentation]Take your application to production #349

Open

elvaliuliuliu closed this Nov 23, 2019

		@@ -0,0 +1,108 @@
		Taking your Spark .Net Application to Production

	- [How to take your application to production when you have single dependency](#how-to-take-your-application-to-production-when-you-have-single-dependency)
	- [How to take your application to production when you have a single dependency](#how-to-take-your-application-to-production-when-you-have-a-single-dependency)

	- [How to take your application to production when you have single dependency](#how-to-take-your-application-to-production-when-you-have-single-dependency)
	- [Single dependency](#single-dependency)

	- [How to take your application to production when you have single dependency](#how-to-take-your-application-to-production-when-you-have-single-dependency)
	- [How to deploy your application when you have a single dependency](#how-to-deploy-your-application-when-you-have-a-single-dependency)

	And you should use `"files": ["adl://<cluster name>.azuredatalakestore.net/<some dir>/nugetLibrary.dll"]` in Scenario 4.
	Additionally, you should use `"files": ["adl://<cluster name>.azuredatalakestore.net/<some dir>/nugetLibrary.dll"]` in Scenario 4.

[Documentation]Instructions on how to take your application to production #345

[Documentation]Instructions on how to take your application to production #345

Uh oh!

Conversation

elvaliuliuliu commented Nov 20, 2019

Uh oh!

imback82 commented Nov 20, 2019

Uh oh!

Choose a reason for hiding this comment

Uh oh!

elvaliuliuliu Nov 20, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

elvaliuliuliu commented Nov 23, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

elvaliuliuliu Nov 20, 2019 •

edited

Loading