#78. Azure Data Factory - Execute Python script from ADF

แชร์
ฝัง
  • เผยแพร่เมื่อ 25 ธ.ค. 2024

ความคิดเห็น • 69

  • @azurelearner4055
    @azurelearner4055 2 ปีที่แล้ว +2

    Great!! Best videos on ADF covering almost all the concepts and are very easy to understand

  • @robertwiley2957
    @robertwiley2957 2 ปีที่แล้ว +1

    This was a great overview! I was thinking about this today and was happy to find your overview. It's a great starting point. Thank you!

  • @stephenmartin6995
    @stephenmartin6995 ปีที่แล้ว

    Very informative. I am currently attempting to execute in-process python to alter a files contents. This is very helpful.

    • @AllAboutBI
      @AllAboutBI  ปีที่แล้ว

      Glad it's of use stephen

  • @metechieandsachinfan768
    @metechieandsachinfan768 4 ปีที่แล้ว +2

    Thanks for a wonderful explanation. Indeed a great flow

  • @thepakcolapcar
    @thepakcolapcar ปีที่แล้ว

    Hello Madam
    Good informational video
    Does setting up of pool and batch service adds cost to azure billing?
    Does batch pool resources are counted only when a python script from ADF is run or is it billed for all the time it is there even if no adf batch executions are invoked?

    • @AllAboutBI
      @AllAboutBI  ปีที่แล้ว

      From the moment you spin up a vm charges start to accumulate even if the nodes are idle.

    • @thepakcolapcar
      @thepakcolapcar ปีที่แล้ว

      @@AllAboutBI
      Thank you very much for the quick response. I wish azure allowed using existing VM already set up for some other activities. This is where all these cloud providers try get revenue 🙂
      So its just the VM gets charged or are there any charges such as pool etc? I am just asking as to get an idea, the info need not be accurate.

  • @ankitpancholi5739
    @ankitpancholi5739 3 ปีที่แล้ว +2

    Good video. Very clear. However when executing these steps I am getting an error
    {"errorCategory":0,"code":"CommandProgramNotFound","message":"The specified command program is not found","details":[{"Name":"Message","Value":"The system cannot find the file specified."}]}
    Unable to run the program. Any pointers please. stderr.txt is empty.

    • @AllAboutBI
      @AllAboutBI  3 ปีที่แล้ว

      Pls watch latest video 114 I have given steps

    • @olegserdyuk5082
      @olegserdyuk5082 ปีที่แล้ว +2

      @@AllAboutBI it is not that easy to find video 114. Can you post a full name or the link?

  • @magdhalsmith2724
    @magdhalsmith2724 2 ปีที่แล้ว

    maam in order run python script in ADF using custom activity, should we enable anything?

  • @thepakcolapcar
    @thepakcolapcar ปีที่แล้ว

    after setting up the batch account and all other details, when i run the pipeling the custom activity errors out saying "Cannot create Shared Access Signature unless Account Key credentials are used." Any clue might help. Thanks

    • @AllAboutBI
      @AllAboutBI  ปีที่แล้ว

      What are you trying to do with the batch job.. if u can send details to funlearn0007@gmail.com i will see if I know something

  • @adidasravi
    @adidasravi 3 ปีที่แล้ว

    Hello madam. Had a question. I have a customized job created in batch service but how can I make adf to make use of that job. Eventhough I've created the job for the pool adf uses, the adf creates its own new job and uses it. I want adf to make use of the existing job in pool. Please let me know if you've any idea

  • @tiananmensquareentertainme7361
    @tiananmensquareentertainme7361 3 ปีที่แล้ว

    Hello madam. Is there any automated way to down scale and up scale the dedicated nodes in a pool? This would help reduce the azure cost. Please let me know your thoughts

  • @nandanagurrapu6201
    @nandanagurrapu6201 3 ปีที่แล้ว +1

    Hi
    I have tried this example of executing python script. I have used same script as yours, but the pipeline failed saying
    The system can not find file specified
    Any help on this.
    Thanks

    • @shilpashilpa2099
      @shilpashilpa2099 3 ปีที่แล้ว +1

      Hi Nandana, were you able to fix this error? I'm also getting same error...can you please help?

    • @AllAboutBI
      @AllAboutBI  2 ปีที่แล้ว

      Hi check this video. th-cam.com/video/_3_eiHX3RKE/w-d-xo.html

  • @justinrey08
    @justinrey08 10 หลายเดือนก่อน

    Will this incur any cost?

  • @guptaashok121
    @guptaashok121 2 ปีที่แล้ว

    When we try to manipulate ads data from python script. Do we need to establish any authentication.. between Pool and adls.. if you have any video for that..

  • @shilpashilpa2099
    @shilpashilpa2099 3 ปีที่แล้ว +1

    Very helpful content.... I tried to experiment and I'm getting the error "CommandProgramNotFound"; did I miss something in my python script?

    • @AllAboutBI
      @AllAboutBI  3 ปีที่แล้ว +1

      Seems like the python file is not found. Pls check the path of the file you are trying to invoke

  • @azurelearner4055
    @azurelearner4055 ปีที่แล้ว

    Thanks for the video and can you run the same code in azure batch itself without the azure data factory

  • @ranadeepbasani8890
    @ranadeepbasani8890 3 ปีที่แล้ว +1

    Hi Madam, Great video. I am using ADF for Bulk and Incremental loads. In an instance i need to do a REST API call for bulk loads. per call i am able to retrieve only 20 records from REST API. where the max can be at 50 records which i am not able to apply in ADF. so for this i started to look into adding a custom python module to hit the API. Is there way we can do this ? please let me know madam

  • @mahendhirang1226
    @mahendhirang1226 2 ปีที่แล้ว

    Hi
    I need details, how I can pass one file as a parameter into our python scripts. I have my source file in storage account..what my script doing converting a parquet(source file) to Xml file..can u pls tell me..

  • @hajaaa100
    @hajaaa100 2 ปีที่แล้ว

    what should i do if my python code imports something like below,
    import pandas as pd
    import os
    from glob import glob
    How to bring that pandas refrence

  • @skselva403
    @skselva403 2 ปีที่แล้ว

    We have a Virtual machine, Pool level Virtual machine is different or the same, it is the same meaning how do I use it?

    • @AllAboutBI
      @AllAboutBI  2 ปีที่แล้ว +1

      Sorry, I don't understand your question selva. Can you pls explain a bit

    • @skselva403
      @skselva403 2 ปีที่แล้ว

      ​@@AllAboutBI Hi Mam thanks for the replay, Pool Irukura Virtual machine and Azure Virtual machine
      different illa same ma mam,
      Ethu same na Pool Azure Virtual machine na use panalama ?

  • @jayanthjayaraman2627
    @jayanthjayaraman2627 3 ปีที่แล้ว

    How do i import the python libraries like shareplum and pandas, i have a python script but i need to integrate it to a pipeline , so i can grab the files from sharepoint using python

  • @Maddychin08
    @Maddychin08 3 ปีที่แล้ว

    Great video !! Can we execute shell script as well ??

  • @clotoscar
    @clotoscar 3 ปีที่แล้ว

    Thanks a lot :) from chile

  • @NeumsFor9
    @NeumsFor9 2 ปีที่แล้ว

    Remember Microsoft's PROJECT REAL from 2005? In it they write a script component to work around the limitations of the Lookup Transform when it comes to implementing Inferred Members without having to keep making calls to the RDBMS for each value not in a lookup transform. They wanted the best of the full cache mode and partial cache mode.
    If you remember it, this would be a GREAT USE CASE for , upon encountering a new value that is not in the cache of either an SSIS lookup or ADF Lookup transform.....and routing the value to another branch.......call the RDBMS, insert the new value, bring back the surrogate key in the pipeline, add it to the cache as a sorted in-memory dictionary and continue to the next row in the pipeline. The next time this same value is encountered in the same pipeline run, it can then reference that dicitionary, pick up the surrogate key, and rejoin the main ETL branch without AGAIN having to make ANOTHER RDBMS call.
    Does this make sense? It would be an awesome demo.....if, indeed you could use the external call transform to either run an Azure Function or call a custom activity or REST API that did the same. Not sure we could achieve the same speed as the SSIS script component, but it would still be cool to see.

  • @dukefler
    @dukefler 2 หลายเดือนก่อน

    Hello Ma'am.... I'll be really grateful if u cud extend this topic further and display Pandas and ML libraries etc in action on Batch Pool... Was just doing an interview task and found it difficult. In the end I went ahead with Databricks Python activity.

  • @mmmm_yesverynice
    @mmmm_yesverynice 3 ปีที่แล้ว

    Why is this better than an automation account

    • @AllAboutBI
      @AllAboutBI  3 ปีที่แล้ว

      Sorry, what's an automation acct

  • @tiananmensquareentertainme7361
    @tiananmensquareentertainme7361 3 ปีที่แล้ว +1

    Thanks a lot madam. This helped a lot for my project. Only the last step the custom output part is not working. If there is some modification in those points do let me know. It's not a show stopper for me anyways

    • @AllAboutBI
      @AllAboutBI  3 ปีที่แล้ว

      Glad to know. Did u use any library? Like pandas in ur script.

    • @tiananmensquareentertainme7361
      @tiananmensquareentertainme7361 3 ปีที่แล้ว

      @@AllAboutBI Hello madam. It's working perfectly fine now. The write file path I had missed out. I thought that was irrelevant. Added those and it's working perfectly

  • @harnesh007
    @harnesh007 3 ปีที่แล้ว

    Thanks for video. Is it possible to handle CSV file with custom + python script ?

    • @abhishekmitra6315
      @abhishekmitra6315 2 ปีที่แล้ว

      I want to know this as well. Kindly let me know solution if found

  • @abhishekkumar-es1wl
    @abhishekkumar-es1wl 3 ปีที่แล้ว

    Your videos are the best material on Azure, Thanks a lot...I have a question, can we restart the ADF pipeline from its point of failure? kind of checkpointing...

    • @AllAboutBI
      @AllAboutBI  3 ปีที่แล้ว

      Thanks for the feedback.
      Yes it's possible using powershe'll. Pl check microsoft documentation on it

    • @abhishekkumar-es1wl
      @abhishekkumar-es1wl 3 ปีที่แล้ว

      @@AllAboutBI Thanks for the swift reply mam...For example I have 500 files to copy from source to destination and after 400 files pipeline or Copy activity got failed, then is it possible that pipelines resumes for exactly for the remaining 100 files?

    • @AllAboutBI
      @AllAboutBI  3 ปีที่แล้ว

      Yes possible

    • @abhishekkumar-es1wl
      @abhishekkumar-es1wl 3 ปีที่แล้ว

      @@AllAboutBI Thanks again!!!

    • @abhishekkumar-es1wl
      @abhishekkumar-es1wl 3 ปีที่แล้ว

      Hi mam, I tried to search the same on net but could not get it. If you have idea then could you pls share any link or steps to achieve this?

  • @prakashchinni6126
    @prakashchinni6126 10 หลายเดือนก่อน

    pls upload english subtitles for this video . very good concept

  • @kathirsubramaniam4597
    @kathirsubramaniam4597 3 ปีที่แล้ว

    Thanks a lot. Could you please put a video on how to execute a PowerShell script from ADF?

  • @elanty1
    @elanty1 3 ปีที่แล้ว

    Hi mam, can I execute power shell code using same method

  • @nico_mano
    @nico_mano ปีที่แล้ว

    Thanks a lot

  • @aadhilimam8253
    @aadhilimam8253 7 หลายเดือนก่อน

    can we try this on free tier account ?

    • @AllAboutBI
      @AllAboutBI  7 หลายเดือนก่อน

      Yes boss

    • @aadhilimam8253
      @aadhilimam8253 7 หลายเดือนก่อน

      @@AllAboutBI thank you

  • @santhoshns2932
    @santhoshns2932 3 ปีที่แล้ว

    Thanks a lot for the Vedio.Could you please share how to import numpy and pandas through adf.Can you please make a Vedio and upload it for us.

  • @swaminathanteamsak7420
    @swaminathanteamsak7420 4 ปีที่แล้ว

    Thanks a lot. Iam trying to do some numpy operations. Can you pls make a video to download numpy and run some numpy codd

  • @techhunter877
    @techhunter877 4 ปีที่แล้ว

    Great content. But I get some error. Can u help

    • @AllAboutBI
      @AllAboutBI  4 ปีที่แล้ว

      Sure. Mail me the details to funlearn0007@gmail.com

  • @oriono9077
    @oriono9077 4 ปีที่แล้ว

    Great

    • @AllAboutBI
      @AllAboutBI  4 ปีที่แล้ว

      Thanks 🙏

    • @chharsha4932
      @chharsha4932 3 ปีที่แล้ว

      @@AllAboutBI Madam can you please make a video for doing SQL to SQl copy activity using python

  • @technologyexcitesme8829
    @technologyexcitesme8829 4 ปีที่แล้ว

    Please show output . That will help us

  • @guruprasadramanaboina8592
    @guruprasadramanaboina8592 2 ปีที่แล้ว

    Hi mam