Extract and Load from External API to Lakehouse using Data Pipelines (Microsoft Fabric)

  Рет қаралды 14,217

Learn Microsoft Fabric with Will

Learn Microsoft Fabric with Will

Күн бұрын

Пікірлер: 49
@chetan2309
@chetan2309 Жыл бұрын
Hey! Massive thanks! Do you’ve plans to cover any oauth based API on your system! Also how to parallelise these APIs for massive data loads! Let say you want to fetch data for 100 cities on everyday basis. Also triggers when 101st is added all those scenarios
@LearnMicrosoftFabric
@LearnMicrosoftFabric Жыл бұрын
Hi, Greats questions! Absolutely yes, I plan to do more videos about handling different auth scenarios, and also loading v big datasets with parallel reads. Watch this space :)
@rdeheld
@rdeheld 3 ай бұрын
Thats not complicated. Would like to see it it the other way around
@jampeauk
@jampeauk Жыл бұрын
Just want to say a massive thank you for your Fabric videos they have been amazing. Keep up the great work.
@LearnMicrosoftFabric
@LearnMicrosoftFabric Жыл бұрын
Hi, thanks for watching! don’t worry, there’s plenty more videos to come!
@jampeauk
@jampeauk Жыл бұрын
@@LearnMicrosoftFabric I may have missed this in your videos but do you have a section on how to show the contents of a file directly and load the most recent file (my files all have date stamps in them). I have not had any luck with os.listdir().
@LearnMicrosoftFabric
@LearnMicrosoftFabric Жыл бұрын
@@jampeauk Hi James, for file system searching you probably want to use mssparkutils which has that kind of list files in a directory functionality - I plan to cover this in my upcoming video on mssparkutils 👍
@jampeauk
@jampeauk Жыл бұрын
@@LearnMicrosoftFabric awesome thanks Will, looking forward to this. To provide a little extra context I would like to list the files located in my S3 Bucket which I have added as a Shortcut.
@alex24tech
@alex24tech 6 ай бұрын
how to run a pipeline for data copying. In fact, I have an API that uses two authentication systems: token and basic authentication (user and password). the first connection to the API (via the post method) allows you to retrieve the token which will be used afterward by the second request to execute the request itself. Is it possible to create a paper that can do the job? should I use nodebooks or is there a solution? the result of the second query will of course be stored in a lakehouse table.
@LearnMicrosoftFabric
@LearnMicrosoftFabric 6 ай бұрын
Yes, should be possible either in Data Pipeline, or Notebook. You can make the post request, then pass the token to your next activity.
@alex24tech
@alex24tech 6 ай бұрын
@@LearnMicrosoftFabric Thanks sir. Please do you have any ressource that can help me?
@dineshreddy2207
@dineshreddy2207 4 ай бұрын
Hi, I have an XML file an want to ingest this file into MS Fabric without using notebook, Can you help me ?
@LearnMicrosoftFabric
@LearnMicrosoftFabric 4 ай бұрын
Should be able to use either Dataflow or Data Pipeline, but if it’s horribly nested XML, notebook will probably be necessary
@itversityitversity7690
@itversityitversity7690 4 ай бұрын
I used copy activity but seems some problem and suggestions please give other way..
@fnplazatuc
@fnplazatuc 5 ай бұрын
Hi, how are u? After data extraction, How its the next step to transform the data and visualize this in MS PowerBi?
@LearnMicrosoftFabric
@LearnMicrosoftFabric 5 ай бұрын
Hi there, good thanks, you? In this video here I go right from end-to-end talking about extraction , storage and then visualization. Hope it helps 👍kzbin.info/www/bejne/nqjahmuMaZ2aY5Y
@fnplazatuc
@fnplazatuc 5 ай бұрын
@@LearnMicrosoftFabric Will how are u? Your video are util! I have a question.. It's possible obtain data from JSON API rest and will transformate to table in a datalake? I can't execute this.. only transform in a Warehouse! Thanks!
@sreekanth0112
@sreekanth0112 4 ай бұрын
Hi, Please make the video on extracting the files from share point to lakehouse through Data pipeline ( Data Factory) in fabric
@gguuyypp
@gguuyypp 3 ай бұрын
Thanks, can you make a video about extracting a file from SFTP ?
@KurtJ-r8w
@KurtJ-r8w Ай бұрын
Really hope you do more Fabric content. You were clear, structured and concise in the teachings done
@DinoAMAntunes
@DinoAMAntunes 7 ай бұрын
Hello Very good Tks very much. My ERP is 100% online but i can´t connect to it. I think i have all the data necessary. URL, db Name, Username Password or API.
@LearnMicrosoftFabric
@LearnMicrosoftFabric 7 ай бұрын
Hey if it's 100% online and an ERP system, it's likely to have an API to connect to. Google " {ERP NAME} API documentation" and find out how to connect to it. Or if it's one of the big ERP systems, you could use a dataflow because they might have a pre-built connector for your ERP system available. Good luck
@anushav3342
@anushav3342 10 ай бұрын
Great content. Thanks for explaining about different options available in Fabric. I need to load a Fact data which is a bookings data through REST API call. How to setup the loading into lakehouse for ingesting weekly updates. Do i need to start with pipeline or is there a way to start with notebook directly to load data into the lakehouse.
@LearnMicrosoftFabric
@LearnMicrosoftFabric 10 ай бұрын
thanks for watching! it depends on the complexity of your api call really! if it’s simple, then you can use dataflows or data pipelines, more complex authentication or transformation will require a notebook
@hotrung5469
@hotrung5469 10 ай бұрын
Thank you so much Will for your detailed instructions!!! Could you help me make an instruction to load Excel files in OneLake (specifically stored in lakehouse) into Tables in Datawahouse?
@LearnMicrosoftFabric
@LearnMicrosoftFabric 10 ай бұрын
hey thanks for watching! to read excel into a lakehouse table, you can either use pandas to load into a pandas df and convert to spark df (and then lakehouse table) or you can use the pyspark.pandas library (pandas within spark) - good luck!
@chescov
@chescov Жыл бұрын
Much appreciated my good sir 👏👏
@LearnMicrosoftFabric
@LearnMicrosoftFabric Жыл бұрын
No problem, thanks for watching ☺️
@stevengarcia7277
@stevengarcia7277 3 ай бұрын
thanks mate, well explained.
@samirsahin5653
@samirsahin5653 Жыл бұрын
I came here for same question. That some people already asked. How to call this api for multiple cities. I watched your other videos that you used notebook to transform data and in other video scheduled in pipeline. If you can show how to call this api for multiple cities, would be a great project. You can create a playlist as a end to end project. I really like your channel, following your daily spark videos. I believe this channel will be one of the main source of fabric youtube channels.
@samirsahin5653
@samirsahin5653 Жыл бұрын
Just saw you already have a playlist:)
@LearnMicrosoftFabric
@LearnMicrosoftFabric Жыл бұрын
Hey! Yes, I plan on continuing this series and going a bit deeper on data pipelines v soon! Thanks for watching and for your kind words 💪🙏
@matask23
@matask23 7 ай бұрын
Amazing video, thanks for this Will! I wanted to ask if PySpark would be the most optimal choice to achieve this or if I could use SQL to achieve the same goal?
@LearnMicrosoftFabric
@LearnMicrosoftFabric 7 ай бұрын
Yes you could also use SQL! The good thing about fabric is that you're free to use whichever language you are comfortable with! (well as long as it's T-SQL, Python, R, Scala or KQL)
@matask23
@matask23 7 ай бұрын
@@LearnMicrosoftFabric Thanks for that, that's really useful to know! I guess my follow up would be whether there's any compatibility issues or limitations that I might encounter if I was to use SQL within MS Fabric?
@mshparber
@mshparber Жыл бұрын
Thanks. Please explain what is best practice to make a nested api calls and merge the results back into one json file? For example, the first api call /students - gives me a list of all students, then for each I need to make another call /{sudent_id}/courses to get their courses information. I need to save the results of all students’ courses as one json file. It’s easy to do in Dataflow, but it cannot save the results as json, only table. So what is the right way to do it in Pipeline?Thanks!
@LearnMicrosoftFabric
@LearnMicrosoftFabric Жыл бұрын
Hey it's not something I've done with Data Pipelines tbh, but might be possible with the For loop activity? If you know how to use Python, I would recommend doing this in Fabric Notebooks with the requests library - much easier to manage this kind of logic in a notebook.
@mshparber
@mshparber Жыл бұрын
Thaks. One of the main advantages in Power BI tools is low-code/ no-code. I know Python, but I we need a simple GUI low-code experience. Like a Power Query / Dataflow. I hope Pipeline can provide it @@LearnMicrosoftFabric
@jampeauk
@jampeauk Жыл бұрын
@@mshparber if it helps there is now a GUI which should do what you are after, do some watching/reading on "Data Wrangler" it is currently only avaliable for Pandas in Notebooks but it should be useful.
@rashane1000
@rashane1000 10 ай бұрын
Awesome video, keep it coming! How about having Oauth2 protocol? New subscriber here, thanks very much!
@LearnMicrosoftFabric
@LearnMicrosoftFabric 10 ай бұрын
Hey thanks for watching! Currently I haven't covered this yet, but I should make something about oauth2 yes because it's such a common use case.
@rashane1000
@rashane1000 10 ай бұрын
@@LearnMicrosoftFabric thanks heaps.looking forward for your next vids 🔥🔥🔥
@FranciscoRodriguezFabric
@FranciscoRodriguezFabric 8 ай бұрын
Thanks !
@LearnMicrosoftFabric
@LearnMicrosoftFabric 8 ай бұрын
No problem, thanks for watching!
@KAshIf0o7
@KAshIf0o7 Жыл бұрын
waiting for next part
@WillOSullivan-k1q
@WillOSullivan-k1q 10 ай бұрын
Good explanations mate keep up the good work
@LearnMicrosoftFabric
@LearnMicrosoftFabric 10 ай бұрын
Cheers Will!
@peternguynguyen5208
@peternguynguyen5208 10 ай бұрын
Nice instructions, thank you
@LearnMicrosoftFabric
@LearnMicrosoftFabric 10 ай бұрын
thanks for watching!
Scheduling Notebooks in Microsoft Fabric + Reading JSON from Dynamic File Paths
6:48
Learn Microsoft Fabric with Will
Рет қаралды 4 М.
Calling JavaScript - Intro to Web Dev Fall 2024
22:00
Kevin Workman
Рет қаралды 32
إخفاء الطعام سرًا تحت الطاولة للتناول لاحقًا 😏🍽️
00:28
حرف إبداعية للمنزل في 5 دقائق
Рет қаралды 36 МЛН
Крутой фокус + секрет! #shorts
00:10
Роман Magic
Рет қаралды 24 МЛН
ДЕНЬ УЧИТЕЛЯ В ШКОЛЕ
01:00
SIDELNIKOVVV
Рет қаралды 2,7 МЛН
Capacities, Workspaces and Access Control in Microsoft Fabric
15:20
Learn Microsoft Fabric with Will
Рет қаралды 13 М.
Using Fabric notebooks (pySpark) to clean and transform real-world JSON data
17:42
Learn Microsoft Fabric with Will
Рет қаралды 6 М.
Microsoft Fabric Lakehouse Tutorial
25:57
Learn Microsoft Fabric with Will
Рет қаралды 6 М.
Lakehouse data validation with Great Expectations in Microsoft Fabric
36:18
Learn Microsoft Fabric with Will
Рет қаралды 4,7 М.
Organize a Fabric Lakehouse using Medallion Architecture Design
36:06
Kamil Data Geek - Azure explained
Рет қаралды 4,1 М.
Microsoft Fabric: How to Ingest API Data Dynamically in Microsoft Fabric
11:38
Data pipeline vs Dataflow vs Shortcut vs Notebook in Microsoft Fabric
31:16
Learn Microsoft Fabric with Will
Рет қаралды 19 М.
Microsoft Fabric - Incremental ETL
26:29
Azure Synapse Analytics
Рет қаралды 16 М.
Microsoft Fabric: Lakehouse vs Warehouse
30:59
James Serra
Рет қаралды 15 М.
إخفاء الطعام سرًا تحت الطاولة للتناول لاحقًا 😏🍽️
00:28
حرف إبداعية للمنزل في 5 دقائق
Рет қаралды 36 МЛН