Google Coding Interview Question - firstDuplicate

  Рет қаралды 240,778

Nick White

Nick White

Күн бұрын

Пікірлер: 502
@do_chill_regular
@do_chill_regular 5 жыл бұрын
The third solution is brilliant, doubt I’d ever come up with that under interview type pressure.
@dassbah5316
@dassbah5316 4 жыл бұрын
The 3rd solution does not solve the space problem. It relies on getting a deep copy of the array that you can change as you want in your function. Which means that it always requires N * size_of(int) memory. The 2nd solution can handle a shallow copy / const reference to the array and requires this amount of memory only in the worst case (no duplicate). One could improve the 3rd solution by creating an array of booleans/bits to indicate that a number was already visited, while using a const reference to the original array. This would result in N bits of additional memory instead of N * size_of(int).
@David-fr7ee
@David-fr7ee 4 жыл бұрын
Dass Bah do not understand but i think u right
@hatrick3117
@hatrick3117 4 жыл бұрын
me brain hurts
@saranshkumar3664
@saranshkumar3664 4 жыл бұрын
Damn it took me around 5min to understand what he was doing!
@TanninZero
@TanninZero 4 жыл бұрын
@@dassbah5316 But it depends on the caller whether you get a copy or not (and if it's not a copy, whether it's ok to change it in the first place!). In the second solution the worst case is actually that the caller created a copy _and_ you create your set/bitmask. Tbh. if I was the interviewer I wouldn't primarily be looking for a great solution to the task, I would be looking for the interviewee to ask good questions. Like "Are we optimizing for cpu cycles or memory usage?" Or "do we expect large arrays or a lot of calls to this function with small arrays?" Because that third solution is dereferencing the array and calling Math.abs a whole lot, the first solution, despite being O(n^2), will actually run faster up to a certain array length because each individual iteration does less work. O-notation gives an indication of how the algorithm scales, not generally how fast it is. Both you and the talking head in the video make a whole bunch of assumptions that I, as an interviewer, wouldn't be that happy about. Over-confidence is a way too common characteristic in programmers which I learned to watch out for.
@bundayyolayinka3352
@bundayyolayinka3352 5 жыл бұрын
I implemented it as the second way straight up. After watching the video to the end, the Third way is really great
@younessalmy1997
@younessalmy1997 4 жыл бұрын
@Thomas Kwok i just started, I implemented it as the second way straight up how do you know the time complexity ?
@divjyotsinghkhanuja1546
@divjyotsinghkhanuja1546 4 жыл бұрын
​@@younessalmy1997 Generally implemented sorting is Quick sort for which the TC is O(nlogn)
@FrancoMunizAR
@FrancoMunizAR 4 жыл бұрын
Third solution is amazing. You're using the logic used in a dictionary/hash table solution but saving space by using the same array. Wish I could come up with this kind of solution by myself haha.
@kunal_chand
@kunal_chand 5 жыл бұрын
Your video on "How to Crack Google Interview (Complete Guide)" is now private. Can you please post it again. It would be really helpful.
@oblivion_2852
@oblivion_2852 5 жыл бұрын
5:24 Just looking at the problem I'd use a hashset and add elements to the set. The hashset only has to be as large as the array length so memory complexity of n and the time complexity would be worst case n as you iterate over the list and check if a value is already within the set if so return that value.
@umeshnaik844
@umeshnaik844 4 жыл бұрын
thats true but in java hashset add method give true and false based on if element added in set or not
@sfcs3743
@sfcs3743 4 жыл бұрын
Hashset is an unnecesary data structure when a simple array with a tally could accomplish the same task
@BrieoRobino
@BrieoRobino 4 жыл бұрын
@@sfcs3743 that's literally the purpose of the hashset.
@sfcs3743
@sfcs3743 4 жыл бұрын
@@BrieoRobino Hashsets don't come for free when considering the overhead and O(1+load-factor) complexity. Worse yet, if the implementation doesn't use clever implementations like universal hashing (Java doesn't iirc), the complexity could be theoretically higher. The wording of the question has restricted the cardinality of the universe of keys to be exactly the same as N. Thus, you can use direct addressing with O(N+N) complexity with an array structure to simulate a hashmap/hashset. Model the index as the keys (input) and the content as a tally (ie +1 when seen). With this method you save some overhead that is associated with a typical implementation of a hashset.
@BrieoRobino
@BrieoRobino 4 жыл бұрын
@@sfcs3743 not everything is a complexity test. This isn't the 1970s where we have to deal with memory limits imposed by the technology. Any potential overhead from using a hash set is negligible at best. If I were doing the code review of someone that had a solution to this problem that was as over-engineered as his third solution I would suggest a hashset. sometimes readability and maintainability is far more important than saving a tiny amount of time and bits.
@sfcs3743
@sfcs3743 4 жыл бұрын
I kinda thought about the last solution in my head. There's an important drawback to this solution that ultimately makes it *O(n)* space complexity. This is because we need a bit to store that the element has been previously visited. In your case, well two's complement complicates things, but for simplicity's sake assume it has a bit for the sign. Originally, I also did this for unsigned integers, and as a way to store the state: I simply added the size of the array to the array and made subtraction/modulo/division to evaluate the information. Both these methods require the maximum number of n (array size) to be less than half of the maximum value the container could hold. If it is a 32 bit int, then it has to be signed or (unsigned/2). 2^31 (signed way) = (2^32)/2 (unsigned way) Thus, there is a stipulation that n cannot exceed the half-value of the element's container making it O(1) for ∀n < (N/2) where N is max number of states where each container's element could hold. Conversely, if we redefined this N to be the size of the array; it would lead to assigning an extra bit for each element. The cost of this space just by itself is logarithmic but the key takeaway is that it has the potential to double the N. The calculation when considering this potential is: total space: *O(n)* = O(N) + O(N) Note: I did not add the logarithmic O (log32(n) for 32-bit container) because it would make this recursive. Δspace (removing original array): *O(n)* = O(N) + O(1) since N (max size of element container) = n (array size) *O(n)* = O(n)
@dipendrachandrajha7341
@dipendrachandrajha7341 5 жыл бұрын
Nick you explained this so clearly step by step asking us to notice the minute details that the elements in array are between 1 to arr.length , then making the use of that point . Great 🤘
@dave6012
@dave6012 3 жыл бұрын
Why is the line height of your comment so much shorter than the rest of the comments?
@cerealnashta
@cerealnashta 3 жыл бұрын
When you first explained the question I immediately knew that it had something to do with the indices, but the only way I could think was to sort the array and compare the indices with the value stored on that indice. But that would give a wrong result if there were multiple duplicates. Your solution was amazing!
@divyanshusingh6473
@divyanshusingh6473 4 жыл бұрын
Mind blown, feeling liberated, some inception level shit. Hats off for clear-cut solution
@skylarkenneth3784
@skylarkenneth3784 4 жыл бұрын
Went form an external (hash) to an internal (1 to N values) mapping. Nice.
@TheNoMoreGamer
@TheNoMoreGamer 4 жыл бұрын
I went through a year or 2 of being taught this in college and never once understood, and it took not even 5 minutes to explain it in a way this easy to understand.
@iamnoob7593
@iamnoob7593 5 жыл бұрын
i really liked this way of explaining through drawing.Continue this style for leetcode problems as well.
@NadyaPena-01
@NadyaPena-01 4 жыл бұрын
I just want to say that I love your videos. I'm preparing for coding interviews now and this is so helpful. I'm going to buy your merch to support. =)
@JoeBonez
@JoeBonez 4 жыл бұрын
The third solution mutates the array, though. Maybe it doesn’t matter, but maybe it does.
@otchigal6527
@otchigal6527 4 жыл бұрын
Joe Bonez true. In essence it’s adding an N bool array
@admkbldwn
@admkbldwn 4 жыл бұрын
We can actually solve this by using a BitSet to keep track of whether an index is flagged or not. It's like an array of bools, except each bool is only a single bit in a small array of bytes. The added memory footprint is relatively small and the original array gets to go untouched!
@svenjaaunes2507
@svenjaaunes2507 4 жыл бұрын
@@otchigal6527 but that means you use linear memory but the condition was constant memory. that trick method is stupid.
@PrzystankersFCB
@PrzystankersFCB 4 жыл бұрын
If the problem was not about first duplicate but any duplicate, then I think there's a way of doing it in O(n) time and O(1) space without destroying the input. Not sure if this is possible for finding the first duplicate though...
@gilbes1139
@gilbes1139 4 жыл бұрын
Given the requirements, it seems like a nasty side effect. A step should be added to reset the values to positive. This method is also not thread safe. Is that a requirement? Who knows. This is why I hate interview questions.
@showmickkar7793
@showmickkar7793 4 жыл бұрын
I was able to come up with the first and the second solution by myself but the third one simply blew my mind! Thank you so much! :D
@Sehyo
@Sehyo 4 жыл бұрын
You can solve it in linear time without extra space and without mutating the array with Floyd's tortoise and hare algorithm.
@sfontesv
@sfontesv 3 жыл бұрын
Floyd's wouldn't work for input [2, 1, 3, 3, 5, 2]
@Sehyo
@Sehyo 3 жыл бұрын
@@sfontesv Yeah Itd find the 2nd duplicate, my bad. Was thinking any duplicate.
@mohammedjawahri5726
@mohammedjawahri5726 4 жыл бұрын
A small comment on 9:25 Being O(N^2) doesnt mean that it takes you N^2 time to do it. It means that your growth rate is quadratic. The time taken could literally be anything but the point here is, if you double the input size, the time taken (whatever it was, it could be a 0.000001 ms) will not double it will 2^2 = 4 aka quadruple Quadruple the input size and the time taken will be the time it took for the original times 16
@micromir4
@micromir4 5 жыл бұрын
Man keep it up. This is the single most useful content for tech professionals!
@ankushgats
@ankushgats 5 жыл бұрын
Before proceeding with third solution its worth asking whether or not you are allowed to change input array.
@Epistemer
@Epistemer 5 жыл бұрын
I think that in the first solution it would've been more optimal if the second pointer would loops only until the min_second_index, so i+1 < j < min_second_index , then if (a[i] == a[j] ) { min_second_index = j }
@Daniel_WR_Hart
@Daniel_WR_Hart 3 жыл бұрын
You're right, good observation
@TJHooper123
@TJHooper123 4 жыл бұрын
I've seen a few responses saying to just sort the array. Here's why you can't. The problem requires you to find the 'first duplicate' in the array. If you have arr = [3, 3, 1, 1, 1, 1]. you can obviously tell that the first duplicate is 3. If you sort the array [1, 1, 1, 1, 3, 3], the first duplicate is now 1. The problem doesn't want to know if there's ANY duplicate. It specifically wants to know which set of numbers in the original array produces a duplicate first. Sorting the numbers completely disregards that requirement.
@DerLetsPlayer0815
@DerLetsPlayer0815 4 жыл бұрын
You will not get around O(n) space complexity in either the second or third case, making the algorithm and analysis in the third case incorrect. If you program this in a strongly-typed language then the integers will be either signed or unsigned, so it could happen that you won't actually have the space to make a number negative (e.g. if you have unsigned 8 bit ints and an array that fits >127 integers, which is a valid input here). In that case you need to expand the data type into a bigger size (in this case 16 bit) and that would mean tetta(n) space complexity, if you just expand every integer. If you use a non-strongly type programming language it will probably expand it internally, leading to O(n) space complexity and if you don't consider this the algorithm (or your analysis) is incorrect. In summary, you still need space to save the sign, and that would lead to incorrect results if the space is just not there (see given example). The reason they give you an array with entries smaller/equal than the array size is for the exact reason that you don't have to hash the numbers to begin with, a HashSet is overkill. They probably just want you to create an array with size n where you use the number as index to check whether you encountered the number first, which again is tetta(n) space complexity.
@sfcs3743
@sfcs3743 4 жыл бұрын
I just made a comment of mine on this. Should have gone through the comments section before typing it.
@44r0n-9
@44r0n-9 3 жыл бұрын
The simplest and cleanest solution with regards to the constraints (if we're unsure whether the original array is allowed to have values that violate the initial constraints) is either using an n-length array of booleans with k-1 as index or a bitset of length n to save even more space.
@JacquesChaumont
@JacquesChaumont 4 жыл бұрын
I was also thinking about why we hadn't used the restriction of 1
@fghsgh
@fghsgh 3 жыл бұрын
The thumbnail gave the third solution away! Also, I assumed the array would be unsigned because negative numbers don't make sense in it, and modifying your inputs is generally a bad practice. Plus, because of CPU architecture details (branch prediction and cache size) I suspect a solution using a bitset would actually be faster.
@lost-mar-ble
@lost-mar-ble 3 жыл бұрын
for anyone unable to get the third answer...nick has another video where a similar question is solved with this exact apporach. Check for return duplicates in integer array. There we return all the duplicates and here we return the first one to be found. The third approach is really beautiful.
@F1mus
@F1mus 4 жыл бұрын
I would argue that making the numbers negative is actually using O(n) space. There's a much more intuitive solution that doesn't require modifying the elements -- only swapping them. Go through the array and put each item "in place". Unless it's already in place, if you try to put it in place and there's already the right element there, that's the answer.
@voltorian-minecraft
@voltorian-minecraft 3 жыл бұрын
i came up with something similar to the second. with my current level of knowledge there's no way i would have thought of the third in any reasonable amount of time, and even after figuring out how it works i feel like i'd still have trouble actually implementing it. i looked at the solutions and even after writing the process on paper i didn't understand how/why it worked until i saw this video, so thanks. i gotta get on the grind to get familiar with the patterns and nuances...
@ehsanganjidoost7825
@ehsanganjidoost7825 3 жыл бұрын
The last one is called Negative marking. This way, you integrated the Seen operation, with index. However, you do not need to deduct one. Your elements are [1, n] and you have n+1 elements. So, whenever you go through the list, just mark negative, the element at such index. So, - nums[nums[idx]]; later if you reach to a duplicate, since you're trying to make nums[nums[idx]] negative, and it's already negative, you will realize, that you met such number before.
@SandeepKumar-nk2ru
@SandeepKumar-nk2ru 3 жыл бұрын
Below solution is similar but little different from concept perspective and easier to understand: Since the input list cannot have negative values (because it is mentioned the elements have to be greater than 1), keep on making the elements -ve as you move through the loop and if you already find a -ve of current element, you return it. (Below sol in python) def first_duplicate(in_list): for i in range(len(in_list)): if -in_list[i] in in_list: return in_list[i] else: in_list[i] = -in_list[i] return -1
@anon746912
@anon746912 4 жыл бұрын
My idea is similar to the 3rd method - but using bitwise checks instead. Initialize an integer with binary length equal to the array length, and use each bit to represent the "seen" status of that position. E.g. 1000 = Only 1 of 4 has been seen; 0110 = 2 and 3 of 4 have been seen. When reading through the array, check the bit at that position. If 1, that's your first duplicate, else flip the bit to 1. Space required would be number of bits equal to length of list, not sure how that would compare to a hashset.
@Revitalish
@Revitalish 3 жыл бұрын
LOVE your latest videos where you start drawing and explaining so much better! You're the BEST !!! Thank you
@oblivion_2852
@oblivion_2852 5 жыл бұрын
I like the idea of using an intrinsic component of numbers like negatives to store boolean values to basically treat the data structure you already have as a hashset (without the modulo and collision detection)
@ak-od7mf
@ak-od7mf 10 ай бұрын
i wish that hed post more of these "coding interview question" videos and add them as a series and/or playlist. Really good videos.
@RedstoneFTW
@RedstoneFTW 4 жыл бұрын
I'm not really a programmer, but I was proud I came up with the exact second answer pretty quick!
@PoulJulle-wb9iu
@PoulJulle-wb9iu 4 жыл бұрын
you came up with a hashset nto being a coder? then you are in math, but whatever stupid
@danailnedkov9561
@danailnedkov9561 4 жыл бұрын
Bullshit
@falseee4445
@falseee4445 4 жыл бұрын
@@PoulJulle-wb9iu u dont need much experience to know hashsets. After just 1 or 2 weeks of learning programmation u can come up with this
@PoulJulle-wb9iu
@PoulJulle-wb9iu 4 жыл бұрын
@@falseee4445 no shit tard
@falseee4445
@falseee4445 4 жыл бұрын
@@PoulJulle-wb9iu ?
@milos5247
@milos5247 4 жыл бұрын
Yea basically, since the values of the array are between 1 to n, means that every value is a valid index if you subtract 1 from it, and if there are duplicates, they are always gonna reference the same index, it's just a matter of figuring out how to track which index was already referenced in constant space.
@magenoir999
@magenoir999 3 жыл бұрын
I almost found the last solution by myself. I didn't think of using the fact that they were integers and swaped the numbers to their indexes, but only swapping to indexes smaller than the current index, while checking each time if the number you're about to swap isn't the same as the current one, if it is return it. in retrospect it may be the best solution for the same problem with unsigned integers
@programaths
@programaths 3 жыл бұрын
The 2.5 solution is to use an array of boolean. If the boolean is already true at the "value-1" index, it's done! In some languages, it's expensive (java) unless you use BitSet (map you nth bit to the correct bit in bytes). In other languages, you can use pointer arithmetic and some bit twiddling! (Doable in Java too, but why stray away from existing solution ?)
@umeshnaik844
@umeshnaik844 4 жыл бұрын
we dont need to use contain method to check if element present in list ... add method itself give true and false for element added successfully and not respectively ..when we find first false for add method we can skip remaining element
@analogylibrary
@analogylibrary 5 жыл бұрын
The last approach of solving the problem in o(n) is similar of finding the smallest positive missing number in unsorted array.( The only difference is that after looping all element and marking number -ve, we have to return the very first index+1 where positive number is found, so here we find first positive number 5 which is at index 3 so 3+1= 4 is smallest positive missing number in unsorted array.)
@t_kon
@t_kon 5 жыл бұрын
It's same in terms of time, but it's not in terms of space. It's not using any extra space.
@theenilenation
@theenilenation 3 жыл бұрын
havent watch the video yet, but could be it possible to use a for loop to scan through the entire array and while scanning look for repeated values, then save the indexes of those values into a new array then compare index to see which index is lower and let that be the final answer. for example [2,1,3,5,3,2] run the loop, int [] indexOfRepeated is [4, 5] , if (4
@taocry
@taocry 4 жыл бұрын
That's only for mutable array. If the language doesn't allow changing input parameter, it might be better to create a new array for the marking and checking.
@Sim0000n
@Sim0000n 3 жыл бұрын
You could further improve the last solution by typing *=-1, instead looking ot up again
@fakhermokadem11
@fakhermokadem11 3 жыл бұрын
it;s like you're using the array itself as a boolean array of flags to each integer, having a duplicate or not. and that's okay because you can clean up by abs() through the array
@createprince2093
@createprince2093 5 жыл бұрын
really digging this style, very thorough but nicely presented
@somerandompersonintheinternet
@somerandompersonintheinternet 4 жыл бұрын
This third solution is not as good as you may think. Either the caller has to be okay with the function mutating the array (which is rarely the case) or you have to copy it (but then you lose the only benefit of this approach over just using a hashset). Basically, it feels hacky and not useful. I think a good compromise would be to use the same principle the solution is using, and use a BitSet instead. It would still have O(N) space complexity, but it would use orders of magnitude less space than the set (depending on sizeof(int))
@mikepschneider
@mikepschneider 3 жыл бұрын
Constant time is not the same as no time. It means the time to access an element does not increase as more items are added, but it will always take non-zero time.
@danielsanchez9879
@danielsanchez9879 5 жыл бұрын
For my solution I went through the array and tried to swap values into their proper place (index = value - 1) one by one, always holding onto the number we displaced and attempting to swap that number into its proper place. If that place already has the proper value, the swap is not completed and we know we have a duplicate. It does use a temporary variable for the swapping though, so that's 0(1) extra space. The data values aren't changed though which might be helpful, idk. You do need to make sure you're skipping over numbers that are already in their spot at the start.
@sfcs3743
@sfcs3743 4 жыл бұрын
Your solution probably breaks down for: [5, 1, 1, 6, 6, 6] where your algorithm would return 6 when it should be 1?
@RaymondLo84
@RaymondLo84 4 жыл бұрын
the last solution is easy if you have seen the solution. Basically, to make that short to understand is we are hacking the array to be the hash table itself. The index mapping is just a hash set itself.
@tobyjacobs1310
@tobyjacobs1310 3 жыл бұрын
I was thinking about this... Was trying to think of a way to use counting sort on a fresh array, but figured that the -ve trick might work given all numbers positive. Only question would be whether my method can mutate the array or not - if it should be read only we'll need O(n) space after all.
@TripPhillips
@TripPhillips 5 жыл бұрын
Isn’t the first solution n log n instead of n^2? The distance the second pointer needs to travel gets smaller with every movement of the first pointer. The second pointer isn’t traversing the entire list every time.
@hazaelbatista3241
@hazaelbatista3241 4 жыл бұрын
The pointer travels N - 1 indexes at first, then N - 2, then N - 3... So it traverses (N - 1)+(N - 2)+(N - 3)+...+1 indexes, which adds to (N^2 - N)/2, giving us O(N^2). Edit: I wrote multiplication of terms instead of sum of terms, thank you @Petar Jovovic for pointing it out
@muneebrana4022
@muneebrana4022 4 жыл бұрын
U normally see logs in algorithms that take a divide and conquer approach, such as merge sort. Since u are cutting your input in half with every iteration, the time complexity involves log. Its like the opposite of n^2 where u double the iterations required.
@petarjovovic308
@petarjovovic308 4 жыл бұрын
@@hazaelbatista3241 You mean (n-1) + (n-2)+.....1 right?
@hazaelbatista3241
@hazaelbatista3241 4 жыл бұрын
@@petarjovovic308 Wow, nice catch, I was kinda sleepy and made that huge mess :( Will edit, thanks!
@davidseek
@davidseek 4 жыл бұрын
Interesting 3rd approach, but in swift a function parameter cant be changed as its a constant. So I would have to create it as a new variable and therefore uses extra space. I could have a &Inout solution, but then it would change the reference element and I’m not sure that this would improve the 2nd solution. But it’s a good conversation one can bring up in an interview.
@richardgmiami
@richardgmiami 4 жыл бұрын
Gonna be honest, I thought the fact that the array couldn't contain a value larger than it's indexes was a red herring. I came up with the second solution (the first one seemed unnecessarily complex and I would never have thought of it) on the spot. But that third solution is really clever. The only possible downside is that you may need to go back over the array to convert the negatives back to positive (but given the scope of the problem presented that wouldn't matter at all).
@asbeeq9513
@asbeeq9513 4 жыл бұрын
I really like your approach of explaining with 3 different methods
@rentristandelacruz
@rentristandelacruz 3 жыл бұрын
Here's a number-theoretic algorithm for this problem (the algorithm is not very efficient and requires a pre-computed resource but I think it's an interesting one): [1] Let P(x) be a function that returns the x-th prime number. This function can be pre-computed. i.e. The dictionary with entries (x, P(x)) can be pre-computed. [2] Let Product = 1. Let 'Array' be the array of numbers. [3] Product = Product * P(Array[0]+1); // if Array[0] = x, multiply 'Product' with the P(x+1) which is the (x+1)-th prime number. [4] LOOP (for i from 1 to n-1): [4a] if (Product % P(Array[i]+1)) != 0: Product = Product * P(Array[i]+1); [4b] If (Product % P(Array[i]+1)) == 0: return Array[i]; // If Product is divisible by P(Array[i]+1), then Product already has P(Array[i]+1) as a factor which means Array[i] has been encounter before. [5] return -1;
@rigenmehilli5864
@rigenmehilli5864 4 жыл бұрын
Array slicing may come in handy for this problem. I agree the third solution is clever, but it’s not straightforward. Why not iterate through the list and for each number see if it exists in the subset of the list up to the previous element?
@aditparida1
@aditparida1 5 жыл бұрын
The last solution was amazing
@PoulJulle-wb9iu
@PoulJulle-wb9iu 4 жыл бұрын
and not made by him
@nvsabhishek7356
@nvsabhishek7356 4 жыл бұрын
Nobody is arguing that he came up with the solution. He is just helping us out to understand it. Noble cause.
@ralexand56
@ralexand56 4 жыл бұрын
@@PoulJulle-wb9iu why so bitter
@PoulJulle-wb9iu
@PoulJulle-wb9iu 4 жыл бұрын
@@ralexand56 lol
@Mobin92
@Mobin92 3 жыл бұрын
In a real project things like the 3rd solution is what adds terrible terrible bugs to the code base... Either it's to complicated that other devs won't understand it and slowly break it. Or it's straight up buggy, e.g. what if the caller of that function didn't expect that the array would get modified? Also it doesn't even necessarily save memory, since you couldn't use a data type that stores only positive integers in the array. So that bit was wasted there instead of in a proper bitset.
@desmondbrown5508
@desmondbrown5508 4 жыл бұрын
Third solution is brilliant, but I'm not sure that would work in many languages as arrays are almost always pass by reference, which means you're really screwing up someone's array, so in order to unscrew it you'd have to go back and make everything positive again. Now, if they said it was some data structure like an arraylist or something, that'd be different because you can pass the list object by value in many languages.
@MrSparkleDragon
@MrSparkleDragon 4 жыл бұрын
Don't use a hash set when the problem clearly states that the input array has values greater than 0 and not greater than the length of the array. Just use an N+1 array of booleans to keep track of whether or not the value n at index n has already been seen or not and iterate from left to right. Index 0 in found doesn't mean anything in this solution, but it keeps it clean. def firstDuplicate(arr): found = [False for i in range(len(arr) +1)] for n in arr: if found[n]: return n found[n] = True return -1
@Bunny501
@Bunny501 4 жыл бұрын
since numbers are limited by size of the array, I would do a second array of true/false values with the same length as the input array. storage_arr[input_arr[i] - 1] = true I mean it's not as good as your final solution, but it only adds a tiny bit of spatial complexity and should be just as fast.
@abdullahimuhammadkhalifa6695
@abdullahimuhammadkhalifa6695 3 жыл бұрын
i spent around 30 min before solving using bruteforce approach, after watching this video,,,Nick is legend
@MW_Futures
@MW_Futures 3 жыл бұрын
First thing that came into my mind was indeed a hashmap or hashset. Never ever would had thought about that thirt solution. Actually cracked my brain a little.
@embedded_software
@embedded_software 4 жыл бұрын
The third solution makes a lot of sense if you think of the array as a function mapping whole numbers to integers. In this case, the domain and range of the function are basically the same. You want to prove that the function is not one-to-one.
4 жыл бұрын
Clever! My initial instinct was to create a bool array (similar to the hashset solution, except I have more control over the space it uses) but I didn't think about using the extra bit they leave us by specifying that they use only half of the signed int space.
@hannahnelson4569
@hannahnelson4569 3 жыл бұрын
I had the same idea, solution 3 is pretty clever though. It basically fakes a bool array by using negation.
@KaramjeetSingh-vp6hp
@KaramjeetSingh-vp6hp 5 жыл бұрын
Bro your way of explaining logic is really cool. :)
@NoName-j8j3o
@NoName-j8j3o 4 жыл бұрын
The second solution is nice. More readable and enough fast.
@esbrasill
@esbrasill 4 жыл бұрын
True, the third solution does use the least memory, but the code is not self explanatory and not futureproof. What is for some reason the code needs to process negative numbers also? I in my code would not use #3
@udaykadam5455
@udaykadam5455 4 жыл бұрын
I did come up with 3rd solution myself with 15 minutes of thinking but the point is, just because you somehow saw though an unintuitive solution once doesn't mean you will always be able to. Sometimes you feel smart, sometimes you feel frustrated.
@Danicker
@Danicker 4 жыл бұрын
Doesn't the second solution still have complexity n^2? Becuase the contains() function must iterate through all the elements in the seen array. Over the length of the original array this would result in n^2 / 2 element comparisons. Only the third solution is actually linear
@solidzack
@solidzack 4 жыл бұрын
The contains() method of a hashset is O(1) in average case since it determines the position with the hashCode Method. However the collision treatment is treated with a tree set(according to internet) so in worst case its O(log(n)), which makes the whole thing O(nlog(n))
@Mojoojo95
@Mojoojo95 4 жыл бұрын
Brilliant! The third solution is essentially the second solution except you are creating the hash table. The key is the value in the array (n), the hash is n - 1, and the value is True or False (-/+).
@Benfalk1982
@Benfalk1982 4 жыл бұрын
Great walk through; I am wondering if mutability of the array is permitted? If it is the third solution is for sure a great one; but if it's not or you need to do "clean up" afterwards the Hashset is probably your best bet.
@nourgaser6838
@nourgaser6838 3 жыл бұрын
Cool! I basically came up with the second solution on my own but I didn't know what hash sets are so I used a boolean array instead. I'll have to look into hash sets...
@omkartotade5337
@omkartotade5337 4 жыл бұрын
While reading the problem statement I was like, there’s got to be some hint in why the values are always between 1 and length of the array. The third solution blew my mind!
@sridharbelide
@sridharbelide 5 жыл бұрын
Could you please let me know what you are using to illustrate the algorithm using some arrows on board??
@johnnypiquel2295
@johnnypiquel2295 4 жыл бұрын
Photoshop
@niteshbabu5731
@niteshbabu5731 4 жыл бұрын
Hey Nick, In the second solution, when you use seen.contains() method, doesn't this method too uses a loop to determine that..? So, time complexity should be O(n²), since seen.contains() is already inside a loop.
@reprogramatucerebro
@reprogramatucerebro 4 жыл бұрын
Set contains() is O(1), ArrayList contains() is O(n²)
@tuhinmukherjee8141
@tuhinmukherjee8141 4 жыл бұрын
One could come up with some sort of optimisation in that brute force approach making the observation that the program should terminate if i>=min_second_index
@ufailowell
@ufailowell 3 жыл бұрын
Wouldn't they want you to make the array back into positive integers before you returned your result? You could mess something up elsewhere changing the numbers like that.
@pritammukherjee3890
@pritammukherjee3890 4 жыл бұрын
I'm writing this without seeing the answer but it could be solved fairly easily. When we're going through the elements of the array, we push THE FIRST element we find in the given array to a STACK OR A VECTOR. Now for the next elements, we first check if the element exists in the vector already, if it does, then it's the answer. If it doesn't, we push it to the vector and continue doing it for all the elements. If no such duplicate is found in the vector, we can say there's no duplicate. This is a fairly efficient algorithm isn't it?
@mgancarzjr
@mgancarzjr 3 жыл бұрын
If it's a stack or simple vector, you'll have to iterate through each value every time you come across a new element in the array. On the second array element, you'll have to search one stack element. On the 27th array element, you'll have to search through 26 stack elements. Size of array : stack searches. 2 : 1 3 : 1 + 2 4: 1 + 2 + 3 5: 1 + 2 + 3 + 4 6: 5-array + 5 7: 6-array + 6 This is a recursive addition problem. Its time complexity is (n^2 + n) / 2. Therefore, for sufficiently large n, it's n^2 in time complexity. With a binary search tree and an array of truly random numbers, your BEST time complexity is log base 2 of n since a binary search tree, ideally, eliminates half the possibilities with each comparison: see radix search. This assumes a completely balanced tree and not a naive one which basically looks like a queue. You'd still need to search this tree n times for any array size n. Thus, you'd end up with n * log base 2 n time complexity (nlogn) in your best scenario.
@alexdang9210
@alexdang9210 4 жыл бұрын
I actually had this question asked in one of my phone screen interviews. I did the double for loop as base solution, then did the hash solution, then did the third one as sorting the array first then traverse once to find next same element (NlogN but does not take space). Then the interviews pushes for a best solution which I came up with the third solution as shown in the video. Though at the end I forgot to minus one with the index thing starting at zero, so some test cases were wrong. I did point it out but I ran out of time. I thought that was good enough since I found all four solutions with the last one just having a hiccups at the end as not having enough time to debug and fix, but I got the rejection nonetheless.
@QualifiedUmbrella
@QualifiedUmbrella 4 жыл бұрын
Wouldn't sorting it not necessarily return the right answer? In his example, if you sorted it you'd return the Lowest duplicate (2), but not the First duplicate (3)
@alexdang9210
@alexdang9210 4 жыл бұрын
@@QualifiedUmbrella Yes, sorry to clarify. In my particular problem, the version was mentioned as only one duplicate.
@kumarc4853
@kumarc4853 3 жыл бұрын
thank you very much Nick, I cracked google and linkedin. Learned a lot from your videos and explanations. Thank you again for the great content
@lovepreetsingh7033
@lovepreetsingh7033 4 жыл бұрын
How about we sort this collection of array with quick sort whose complexity is O(nlgn) and after that we can traverse the sorted array linearly to find the first duplicate number which will be almost worst case will be O(N). So it will O(nlgn) + O(n).
@prasadmascarenhas8037
@prasadmascarenhas8037 4 жыл бұрын
What if we create a list then keep adding elements and use a if(list.contians(element))?return the element
@kushalbaldev8490
@kushalbaldev8490 4 жыл бұрын
It wont work suppose you have an array like [1,3,3,1,2,3] take an empty array start adding elements first goes 1 then 3 then again 3 so checking 3 will say you that the array contains 3 and you return 3 as an answer but the answer is 1 as you need to identify the first duplicate means the occurrence of first number from the array which can be duplicated later on..!!
@sonnyakakitha2580
@sonnyakakitha2580 4 жыл бұрын
my first solution is a set < int > duplicate and for each element in array ask if value is in set, if answare is true, this is the solution, else insert the value
@augustuscaeser8939
@augustuscaeser8939 2 жыл бұрын
AMAZING! Please make more of these videos!
@mohitsoni7787
@mohitsoni7787 5 жыл бұрын
Dude your explanation is fantastic
@nGAGE0nline
@nGAGE0nline 4 жыл бұрын
Shouldn't we be able to exit the loop if i==min_second_index ? Seems unnecessary to continue looping, unless I'm missing something (it's late and I'm tired :P ) edit: oh, video's not over... there's more, lol
@croydon21H
@croydon21H 3 жыл бұрын
Sadly, didn't get the mathematical basis of the third negative number approach. Looks great when we see array size 5, but what if a few hundreds?
@narayanbhat3279
@narayanbhat3279 4 жыл бұрын
In the brute force approach, we should just return the element, the time we find a duplicate why let the loop run till n^2
@okeyD90232
@okeyD90232 3 жыл бұрын
Worst case scenario where you have n^2 calculation and there are no duplicates, or duplicates are at the end of array
@NickKravitz
@NickKravitz 4 жыл бұрын
But isn't the .contains() an O(log n) operation? As of java 8 the hashset is backed by a tree structure; traversing it to find whether the seen list contains your int value will add time complexity. So it might actually be a O(n log n) time complexity.
@johnnyb175
@johnnyb175 4 жыл бұрын
If what you're saying is true, a simple fix would be to use a regular array, A[ ], of the same size with all values initialized as 0. For each value, n, you come across in the original array, check the value at index A[n-1]. If it's 0, store n. If it's not zero, return n.
@sangramjitchakraborty7845
@sangramjitchakraborty7845 4 жыл бұрын
Just looking at the problem my first thought was to use a simple array indexed from 0 to 9. Ofcourse it won't work if the values were not single digit or if they were floats. In that case I would probably use a hashset first.
@fredV35
@fredV35 4 жыл бұрын
I wonder if the second solution is really that linear, because in fact you still have to loop through the seen numbers. But I don't much about Java.
@glad1k031
@glad1k031 4 жыл бұрын
thats not about java, but about hashset, its same in every lang
@RexZShadow
@RexZShadow 3 жыл бұрын
So I'm really curious if you get these kind of questions during like the on-site interviews? Like these easy level questions seem really easy while the medium ones seem much harder.
@deadendjesenice
@deadendjesenice 4 жыл бұрын
Seeing that we are searching for the first duplicate why do you need a minimum index? You just return it as soon as you find it and outside the for loops you return -1
@MyManJohnny
@MyManJohnny 4 жыл бұрын
You could simplify the second solution a little bit by just doing this inside the for loop: if(!seen.add(a[i])) return a[i]; Set in java returns false if it already contains the value.
@GauravBoraJodhpur
@GauravBoraJodhpur 4 жыл бұрын
If the numbers are going to be less than array length, can we create a bit set and set the bits according to values in the array? And then we traverse and if we come across a value in array which has the corresponding bit already set it is a duplicate ?
@everytimeifall
@everytimeifall 4 жыл бұрын
"Constant time means it is not gonna take any time at all" LMAO 😂🤣 10:53
@yurivarloe1526
@yurivarloe1526 3 жыл бұрын
I mean, relatively speaking
@alonamaloh
@alonamaloh 4 жыл бұрын
How is that last solution not using extra space? It uses N signs, but it cheats by using the sign bits of the given array. If the problem asked to simply return one of the numbers that appears more than once, there is a legitimate O(N) time, O(1) memory solution, but that one is really really tricky.
@Raj_Tanvar
@Raj_Tanvar 3 жыл бұрын
3rd solution is just awesome 🎉🎉
@JustAndrewIsFine
@JustAndrewIsFine 4 жыл бұрын
Would the for j = i + 1 loop make it O(n log n) because each subsequent j loop have one less check?
@DragXGaming
@DragXGaming 4 жыл бұрын
Before watching the solutions I would make a new array and basically make a copy array. As I run through the first array I add the values to the new array. I check each value to see if it's already in the array, if it's in the array it's the first duplicate.
@TJHooper123
@TJHooper123 4 жыл бұрын
That's basically what the second solution was. It's a decent solution, but you're adding space complexity to the algorithm because you're creating a new data structure to keep track of the values from the initial array. The third solution doesn't take up additional memory and instead just modifies the original array values in place.
@DragXGaming
@DragXGaming 4 жыл бұрын
@@TJHooper123 is the space added because of the array the same amount of space added with the data structure used in the video
@TJHooper123
@TJHooper123 4 жыл бұрын
DragX Gaming I don’t think so? Different data structures take up different amount of memory. A single character takes up less bytes than a hash table.
Facebook Coding Interview Question - findLongestSubarrayBySum
13:47
Amazon Coding Interview Question - Rotate Image
17:26
Nick White
Рет қаралды 245 М.
Tuna 🍣 ​⁠@patrickzeinali ​⁠@ChefRush
00:48
albert_cancook
Рет қаралды 148 МЛН
Арыстанның айқасы, Тәуіржанның шайқасы!
25:51
QosLike / ҚосЛайк / Косылайық
Рет қаралды 700 М.
Easy Google Coding Interview With Ben Awad
28:00
Clément Mihailescu
Рет қаралды 1 МЛН
Google's congressional hearing highlights in 11 minutes
10:58
Solving A Classic Google Interview Logic Puzzle
9:03
MindYourDecisions
Рет қаралды 8 МЛН
I made maps that show time instead of space
10:44
Václav Volhejn
Рет қаралды 975 М.
How I Coded An Entire Game Using ChatGPT
19:48
Nick White
Рет қаралды 122 М.
Facebook Coding Interview Question - sortedSquaredArray
12:46
Nick White
Рет қаралды 194 М.
How to: Work at Google - Example Coding/Engineering Interview
24:02
Life at Google
Рет қаралды 7 МЛН
Google Coding Interview With A High School Student
57:24
Clément Mihailescu
Рет қаралды 4,3 МЛН
Self Taught Programmers... Listen Up.
11:21
Nick White
Рет қаралды 1,1 МЛН