My 2 cents: Product similarity is a complex problem to solve even for established top 4 companies. Image matching for product requires data science solution but that's just one small part. String matching could be managed better with data science solution, but if all you have is SQL, you can still create a good baseline using string functions like regex fns for matching. Your product matching confidence increases when you introduce more parameter to match. Dimensions like weight, size, height as pointed by someone in comments are good examples, Reviews are not simply due to sheer noise and overhead involved in parsing long winded strings. Think of what parameters will you look at when you buy a smartphone, a good laptop, a nice pair of earphones and retain only those parameters which makes sense to match two products.
@dhaneshshirodkar3654 жыл бұрын
This is more like a Data Science interview
@upskill92573 жыл бұрын
I've been looking for realistic data science/analytics case study discussions for some time. This has been by far the most relatable one for me. Please add more content like this.
@manoharlakshmana61714 жыл бұрын
This is what we need !!! Case study mock interview !!! Thank you so much !!! Hope you will upload more such videos.
@iqjayfeng4 жыл бұрын
Thanks for watching! If you want to prepare for Business Intelligence roles, then Interview Query has you covered! We have diverse questions to prepare for your Business Intelligence interview and stay ahead of the curve. www.interviewquery.com/p/business-intelligence-interview-questions
@nitishnayak68344 жыл бұрын
Can we use dimensions and weight of the mobile. Definitely similar products will have similar description and design
@jaganshemthegay54904 жыл бұрын
Consider sone master data attributes and combination of these attribute values gives the uniqueness to derive .. color , size , iOS version , first available date etc and some other features and do deduplication and map all product names to one unique artificial key
@mallikatiwari2524 жыл бұрын
Super Helpful ! Can you please do similar video on Amazon Business Analyst?
@manoharlakshmana61714 жыл бұрын
Please Have a video on : Suppose an ecommerce website is planning for a flash sale. How would you select which products to put on the sale.you can add more questions to it and then followed by A/B testing what kind of statistical tests would be used and where.
@jaganshemthegay54904 жыл бұрын
De duplication is great master data quality exercise that needs domain experts inputs and study of the features and attributes.. Once de duplicated transactions can be tied down to unique product is which gives better analytics
@DED_Search4 жыл бұрын
This is a very classic interview question for e commerce business. 👍 But this mock interview only scratched the surface of it. I have been asked the same question. I was grilled on similarity measures, cnn for image processing, rnn for nlp, etc.
@upskill92573 жыл бұрын
Can you share some additional parameters that were not covered here?
@tigerbear30382 жыл бұрын
SKUs often differ from merchant to merchant. You have to look at the model number or UPC
@mr.garammasala13983 жыл бұрын
Fuzzy look up/ fuzzy grouping works here.
@xiaoyangwu92704 жыл бұрын
Thanks for all the advices on implementing ML/DS algorithms into data engineer/Data warehouse part of the task, I am always wondering how those two can be combined when comes to solving problems togather
@chitranjantrivedi60083 жыл бұрын
Bro - Do we need to know Data Structures and Algorithms to crack Business Intelligence Engineer interviews at Amazon?
@tonestrada2000 Жыл бұрын
Do you have sample data for this? This seems a good case study for beginners like me
@karthikjosyula80024 жыл бұрын
wouldn't equivalent classes solve the problem of de-duping the classes Apple iPhone 10 = iPhone X... We can store these classes in under same ID and use the ID. Even tracking seller details where same seller has same product, labelled differently, posted multiple times.
@saumya45823 жыл бұрын
Thanks for this and many other videos Jay! I was looking for ways to get to know about the interview questions asked to work upon and stumbled upon your channel. Super helpful as I had been rejected at my dream company's interview and was starting again to work on my gaps to be able to apply again for the interviews.
@SoraAICreator4 жыл бұрын
This is really helpful. Thanks a lot. Can you do a mock Product Analyst Interview as well?
@fionaw99424 жыл бұрын
Does BIE need to have ML knowledge?
@iqjayfeng4 жыл бұрын
Depends on the role but usually no
@mitchieyy2 жыл бұрын
if the product have the same unique id such as SKU UPC they definitely get under the same listing when you list them. as an amazon seller i know that sellers get the product from the same manufacturer and they use their own brand name. that’s why you see the same product over and over again on amazon but they will have their distinct brand name, like kirkland and huggies get their diapers from the same manufacturer. with the big companies they manufacture the product slightly differently and package them differently. but with small amazon seller they don’t do all that and their product photo looks the same. In this case i don’t think they could be identified as the same because they are different brand. So to identify true duplicate that have similar image most importantly you need to check their brand are the same and then their key product description such as number of unit weight color size version are the same should they be considered possible duplicates.
@allison-hd1fg2 жыл бұрын
I think this case study is more for data scientist role. you sure this is a case for BIE?
@iqjayfeng2 жыл бұрын
Definitely for BIE roles at Amazon specifically
@pushkarajpalnitkar16954 жыл бұрын
Hi, Jay and Shashank, thanks for the video. I have one doubt though. Did he have to give data structures and algorithms' interview for the job?
@iqjayfeng4 жыл бұрын
Nope I don’t believe so
@yashsolanki0694 жыл бұрын
You are doing really great sir👍
@asthasrivastava95643 жыл бұрын
Quick question - Has anyone faced a question like this for the BIE role (may or may not be Amazon)?
@saniyajaswani44622 жыл бұрын
His solution is not scalable If we make such algorithm, how do we scale for 10000 products. Amazon does least manual intervention on duplication product detection
@fredbradshaw43082 жыл бұрын
I was thinking simple Regex 😂
@kushagrasen18484 жыл бұрын
we can also take the product dimensions
@porwalabhishek54 жыл бұрын
This is so helpful
@EmWatsonable4 жыл бұрын
Thanks for sharing! This is super helpful :)
@annachan67063 жыл бұрын
why not where product name LIKE '%iphone%'
@extcresources5313 жыл бұрын
Cos that would give you same brand but phones of different models.. iphone 6 iphone 7 iphone 8 etc..
@rish-l3z4 жыл бұрын
Thank you for the Video and the contents. Really appreciated.
@kalpeshsajnani99523 жыл бұрын
Thanks a lot for such a informative video
@alexz70324 жыл бұрын
Awesome stuff! thanks
@kaiyan95893 ай бұрын
I will probably just get lost by asking why we even want to de-dup the products on Amazon. To me, the question doesn't make sense. Having multiple sellers selling the same product (duplicated product) sounds like a good sign which creates healthy competitions.
@julialjo79213 жыл бұрын
“Nobody knew how to solve these tasks” and then Rajesh came all the way from India to save them.
@FPrimeHD16183 жыл бұрын
This helps out a lot, much appreciated. I have an interview for a Sr BIE position next week. First time interviewing with Amazon and I'm freaking out lol.
@extcresources5313 жыл бұрын
What was the result of your interviews? What all were asked? If you could share it will help a soul. Thanks mate
@FPrimeHD16183 жыл бұрын
@@extcresources531 I actually was offered a really good role as a data scientist in my current company that I couldn't pass up, so I stopped all interviews.
@toekneema3 жыл бұрын
what is a DFIDF? 7:40
@nanfengbb3 жыл бұрын
Should be TFIDF, a technique commonly used for NLP.
@nhrakib15723 жыл бұрын
hi dear how are you
@vincentpun1233 жыл бұрын
nice
@user-fl9ti9ej8g3 ай бұрын
This guy didn’t do so well lol
@BasitBhat4 жыл бұрын
Stop moving that chair. Just a quick feedback.👍🏼
@romeomonteque2 жыл бұрын
Um. Uh. Um. Uummmm. Um.. um. Before every sentence. Ummmm
@vallr-humans91213 жыл бұрын
Can't even understand what he is speaking. Sitting ungroomed. 👎