Find Median from Data Stream - Heap & Priority Queue - Leetcode 295

แชร์
ฝัง
  • เผยแพร่เมื่อ 29 ก.ย. 2024

ความคิดเห็น • 223

  • @richardnorth1881
    @richardnorth1881 2 ปีที่แล้ว +307

    Anyone who can come up with solutions like this, on the fly, without having first studied the problem, is a genius and I salute them. Also thank you Neetcode for this video

    • @cyliu2434
      @cyliu2434 2 ปีที่แล้ว

      no such human exists

    • @rohitkumaram
      @rohitkumaram 2 ปีที่แล้ว +42

      The only purpose of asking this kind of question in interview , probably that have you mugged up all leetcode solutions.

    • @thePontiacBandit
      @thePontiacBandit 2 ปีที่แล้ว +18

      Testing preparedness. This kind of problem solving is healthy and great but not a predictor of success as a developer.

    • @APudgyPanda96
      @APudgyPanda96 2 ปีที่แล้ว +45

      @@thePontiacBandit hard disagree. I think asking this in an interview is extremely unhealthy and a waste of time for everyone invovled

    • @thePontiacBandit
      @thePontiacBandit 2 ปีที่แล้ว +8

      @@APudgyPanda96 I agree with you. I'm trying to imagine why they would even have something like this. If I structured interviews like this, I'd be testing preparedness. It wouldn't be a sign of a good developer at all.

  • @MichaelButlerC
    @MichaelButlerC 11 หลายเดือนก่อน +14

    This is just incredible... for some reason I either never learned, or don't remember learning, Heaps in my computer science education at university. Throughout the years I have always heard the word Heap but never really drilled down into what it did or where it is useful. This really inspired me to fill the gap!!

  • @UnknownEntity606
    @UnknownEntity606 หลายเดือนก่อน +1

    Started grinding seriously as my I have a big tech interview coming up. The more obscure problems have video solutions on youtube which are horribly explained. I really appreciate how great your solutions are now, and I already loved them before. Thank you NeetCode, hopefully everything goes well!

  • @charlesmrader
    @charlesmrader 2 ปีที่แล้ว +168

    I dealt with a slightly different problem in the 1980s. I was to find the "moving median" of N points in a data sequence that keeps on coming. But I wasn't going to do this with programming, but with dedicated hardware, like gate arrays. At any instant, there were N saved points. The oldest was discarded, the sample just arrived was inserted, and the median of the N points in the list was reported. I used a min-heap and a max heap. The exciting thing was that all the comparisons and data movements in working with a heap can be done in parallel, so the O(log N) time per sample reduces to O(1).

    • @johnvonhorn2942
      @johnvonhorn2942 2 ปีที่แล้ว +40

      You're on another level, Mr Radar - God Mode.

    • @oxyht
      @oxyht 2 ปีที่แล้ว +12

      That's so fascinating.

    • @xskrish
      @xskrish 2 ปีที่แล้ว +1

      wow!!

    • @griffinoluoch7804
      @griffinoluoch7804 2 ปีที่แล้ว +1

      Hey I just solved this on leetcode..no. 480. It's the same problem?

    • @nguyen-dev
      @nguyen-dev ปีที่แล้ว +9

      haha, you started DSA since 1980s, even implementing with low-level hardware, so this channel is too easy for you, I think.

  • @dylanwu8631
    @dylanwu8631 2 ปีที่แล้ว +18

    Excellent work on explaining this!
    I was wondering if you can do a follow up video expounding upon the other approaches to the problem? IMHO, the primary reason this a leetcode hard problem is because of the myriad of approaches to solving it especially given different constraints (i.e. memory bound - how can do we do it with Reservoir Sampling, Segment Trees, etc.) I saw the Leetcode solution to this problem mentions these different approaches but I think you will do a much better job of explaining it :)

    • @lokeshnandanwar9203
      @lokeshnandanwar9203 5 หลายเดือนก่อน

      is leetcode premium worth it to see leetcode offcial solutions?

  • @akashsrivastava8324
    @akashsrivastava8324 3 ปีที่แล้ว +6

    Awesome explanation... thank you!!

    • @NeetCode
      @NeetCode  3 ปีที่แล้ว

      Thanks, much appreciated

  • @OmriSama
    @OmriSama ปีที่แล้ว +1

    Why do you do this thing where you first add to the left by default, and then check & rebalance between the two heaps? Can't you peek at the min/max in either tree and decide where it should go based on that, and based on the current sizes of each tree without having to actually go through an `add` operation?

  • @garg_krish
    @garg_krish ปีที่แล้ว +1

    Why can't I think of such amazing solutions 😢

  • @peterkim1867
    @peterkim1867 2 ปีที่แล้ว +1

    Wouldn't you get an index out of range if the first action was to find median? before adding at least two elements?
    Line 34 would run but there wouldn't be anything to return from the heaps.
    line 34: return ( -1 * self.small[0] + self.large[0]) / 2

    • @misterimpulsism
      @misterimpulsism 2 ปีที่แล้ว

      This is correct. The problem states that addNum() will be called first so index out of range won't happen. You could add a guard clause that returns 0 if both heaps are empty.

  • @estifanosbireda1892
    @estifanosbireda1892 2 ปีที่แล้ว +2

    a fantastic explanation as always, Thanks!

  • @shantanusharma184
    @shantanusharma184 2 ปีที่แล้ว +1

    Best explanation. Thank you

  • @chloe3337
    @chloe3337 2 ปีที่แล้ว +5

    heya! im not sure why the solution works for you, but i had to divide by 2.0 instead of 2 because of some floating error - took very long to debug this :/ had to read through the discuss section in leetcode to find some answers - but thanks for doing this!

    • @nks36
      @nks36 2 ปีที่แล้ว

      Can you link it here?

    • @justinlawrence5611
      @justinlawrence5611 2 ปีที่แล้ว

      Thanks for this I was wondering why mine was working too. This fixed it.

  • @karthik829
    @karthik829 3 ปีที่แล้ว +3

    Thanks!

    • @NeetCode
      @NeetCode  3 ปีที่แล้ว

      Thank you so much, I really appreciate it! 😊

  • @halflearned2190
    @halflearned2190 2 ปีที่แล้ว

    Very nicely presented, thanks!

  • @akshaychavan5511
    @akshaychavan5511 7 หลายเดือนก่อน

    Came up with this solution without watching the coded solution -
    class MedianFinder:
    def __init__(self):
    self.maxHeap = [] # lower half of the sorted array
    self.minHeap = [] # upper half of the sorted array
    def addNum(self, num: int) -> None:
    heappush(self.maxHeap, -num)
    # if all elements in maxHeap are not smaller than all elements in meanHeap
    if self.minHeap and self.maxHeap and (-self.maxHeap[0]) > self.minHeap[0]:
    heappush(self.minHeap, -heappop(self.maxHeap))
    # check if difference in size of min heap and max heap is within allowable limit
    sizeDiff = len(self.maxHeap) - len(self.minHeap)
    if sizeDiff>1: # maxHeap has 1+ extra elements
    heappush(self.minHeap, -heappop(self.maxHeap))
    elif sizeDiff < 0: # minHeap has extra elements
    heappush(self.maxHeap, -heappop(self.minHeap))
    def findMedian(self) -> float:
    if (len(self.minHeap) + len(self.maxHeap))%2==0: # even size
    return (self.minHeap[0] - self.maxHeap[0])/2
    else:
    return -self.maxHeap[0]

  • @niteshmodi5468
    @niteshmodi5468 ปีที่แล้ว

    Great... Although you code in python, which is above my head, thanks for the intuition though;

  • @edwardteach2
    @edwardteach2 3 ปีที่แล้ว +1

    U a God

  • @NianlongLin-hy4jy
    @NianlongLin-hy4jy ปีที่แล้ว

    thank you

  • @tanayshah275
    @tanayshah275 3 ปีที่แล้ว +1

    so maximum time complexity for add operation will be : 3 log N , correct ?

  • @vaidehidharkar3348
    @vaidehidharkar3348 2 ปีที่แล้ว

    What is time complexity for this solution?

  • @Dipenparmar12
    @Dipenparmar12 ปีที่แล้ว

    How can we create a heap in Js.

  • @yonasstephen
    @yonasstephen 3 ปีที่แล้ว

    I wonder why not use a binary search instead? Since we can build the list sorted, we can binary search to find the position to insert new num in O(logn)? Then retrieve is just the same as in-order insertion which is O(1). Is it because inserting an item into an array itself takes O(n) due to copy operation in the underlying data structure?

    • @Rahul-pr1zr
      @Rahul-pr1zr 3 ปีที่แล้ว +1

      I coded this using a binary search based approach. The downside is that elements will need to be shifted right once you get the position and that can potentially be O(n).

    • @sravanikatasani6502
      @sravanikatasani6502 3 ปีที่แล้ว

      we got to shift elements again right!! thats O(n)

  • @ke5684
    @ke5684 2 ปีที่แล้ว

    haven’t done this problem yet saw title of this video suggested by youtube. what s massive spoiler tony stark dies in endgame

  • @Grim_tidings
    @Grim_tidings 2 ปีที่แล้ว +20

    I'm a bit confused on why to add the new numbers directly to small heap by default. Couldn't this be optimized by comparing the incoming number to both of the root nodes and adding the new one to the small/large heap depending on if the new value is less than or greater than the root nodes?
    Like let's take 3547:
    First add 3 to the small heap - O(1)
    Next 5 comes in, is greater than 3, add to large heap - O(1)
    Next 4 comes in, is between the values, add to small heap - O (log n)
    Next 7 comes in, is larger than 4 and larger than 5, add to large heap - O (log n)
    if that last number was a 1, its smaller than 4 and 5, add to small heap. Now the heaps are unbalanced so pop 4 from small heap, push to large heap - O (log n) + O (log n)
    This way whenever your trees are imbalanced you can just pop the root of the larger one and push it to the smaller one to keep them always balanced and at most you have to push and pop once rather than twice with the "swap" method in the video.
    I may be missing something, but it seems inefficient at first glance.
    Other than that, great video, you always explain these so concisely 👍

    • @pekarna
      @pekarna 2 ปีที่แล้ว

      You're right.

    • @dumdumbringgumgum2940
      @dumdumbringgumgum2940 2 ปีที่แล้ว +1

      i may have figured out why.. in case either of the 2 heaps keeps increasing due to the value we keep pushing and it becomes too large. We are already making a swap why not do it without all the lookup.

    • @AbdelrahmanMoussaAbuOuf
      @AbdelrahmanMoussaAbuOuf 2 ปีที่แล้ว +1

      @@dumdumbringgumgum2940 You are correct, but remember the lookup is merely an O(1) and it can save you a couple of O(log n) operations in case the numbers are sort of balanced such as in the example

    • @jeffli5017
      @jeffli5017 2 ปีที่แล้ว +3

      totally!! I guess he just wanted to to make the code cleaner and easier to understand.

    • @shubhampatel7870
      @shubhampatel7870 หลายเดือนก่อน

      Sounds right to me.. instead of always adding to first heap.. deciding on which one to add will save some rebalancing steps.

  • @bernhardhausleitner60
    @bernhardhausleitner60 2 ปีที่แล้ว +8

    I had this in an interview and was asked:
    - "If all integer numbers from the stream are in the range [0, 100], how would you optimize your solution?"
    - "If 99% of all integer numbers from the stream are in the range [0, 100], how would you optimize your solution?"
    How would you go about this? :) Thanks!

    • @PippyPappyPatterson
      @PippyPappyPatterson 2 ปีที่แล้ว

      bump, I'm also curious how one would do this.
      Did you give any answers or did you intervieweres give any response Bernhard?
      My only thought was you could use an 8-bit integer or float.

    • @mongoose2014
      @mongoose2014 2 ปีที่แล้ว +12

      Maybe initialise an array of size 100, where the key is the number and the value is the frequency. You can then get the answer in constant time by iterating through until you reach the halfway point.
      If 99% of the numbers are in the range you could do the same thing but have two extra variables for "over 100" and "under 100" and this would work as long as the median number was in the range 0-100

    • @PippyPappyPatterson
      @PippyPappyPatterson 2 ปีที่แล้ว

      @@mongoose2014 wow that's great. Had you done something like that on a previous problem? Very clever of you.

    • @TheMadRunner00
      @TheMadRunner00 ปีที่แล้ว

      Yes, as @mongoose2014 said, one can use set/dict approach to solve this.

  • @protyaybanerjee5051
    @protyaybanerjee5051 3 ปีที่แล้ว +22

    Very nice visual explanation. I would suggest, that instead of saying size is approximately equal, can we say size difference must be at MOST 1 ?

    • @AsliArtistVlogs
      @AsliArtistVlogs 2 ปีที่แล้ว +2

      Yes if its not, simply remove from whichever is larger and insert into the other.

  • @JonathanBatchelder
    @JonathanBatchelder 2 ปีที่แล้ว +22

    10:18
    But two, that's "two" big of a difference 😁
    Thank you NeetCode, your videos have helped me understand hard problems a bunch!

  • @aaron10k
    @aaron10k 6 หลายเดือนก่อน +2

    bruh you need to start rehearsing your videos. sloppy ahh code

  • @maniskarki
    @maniskarki ปีที่แล้ว +3

    A bit shorter and easier:
    import heapq
    class MedianFinder:
    def __init__(self):
    self.min_heap = [] # To store the larger half of numbers
    self.max_heap = [] # To store the smaller half of numbers
    def addNum(self, num: int) -> None:
    # Push the number to the max-heap (negated) to simulate a min-heap
    heapq.heappush(self.max_heap, -num)
    # Pop the smallest number from the max-heap and push it to the min-heap
    heapq.heappush(self.min_heap, -heapq.heappop(self.max_heap))
    # If the min-heap has more elements than the max-heap, balance the heaps
    if len(self.min_heap) > len(self.max_heap):
    heapq.heappush(self.max_heap, -heapq.heappop(self.min_heap))
    def findMedian(self) -> float:
    # If the total count of numbers is odd, the median is the top of the max-heap
    if len(self.max_heap) > len(self.min_heap):
    return -self.max_heap[0]
    # If the total count of numbers is even, the median is the average of the tops of both heaps
    return (-self.max_heap[0] + self.min_heap[0]) / 2

  • @memevideos7461
    @memevideos7461 หลายเดือนก่อน +1

    how do you recognize that you can use 2 heaps instead of an array + binary search

  • @ranaafifi5487
    @ranaafifi5487 ปีที่แล้ว +4

    Thank you for this great explanation! I was wondering why can't we use a self-balancing BST like AVL or Red Black Tree ?

    • @karanshah2283
      @karanshah2283 9 หลายเดือนก่อน

      You can definitely use them but heaps are more optimal

  • @kmichal456
    @kmichal456 ปีที่แล้ว +2

    Is this not the same as using a single heap?

  • @KeshavKumar69420
    @KeshavKumar69420 3 ปีที่แล้ว +6

    Does using heapq._heapify_max a standard for implementing a max heap or we need to use -1 with min heap?

    • @play005517
      @play005517 ปีที่แล้ว

      no, first, this is not a standard method that subject to be changed without any notice. Second, it will introduce bugs if used outside the implementation of heapq.nsmallest
      It's an implementation specific method alongside heapq._heappop_max and heapq._heappush_max used to aid the implementation for heapq.nsmallest
      They will not maintain proper max heap structure when used in ways other than inside heapq.nsmallest

  • @AlfredPros
    @AlfredPros หลายเดือนก่อน +1

    Here's a cheat solution with fast performance.
    from sortedcontainers import SortedList
    class MedianFinder:
    def __init__(self):
    self.list_num = SortedList()
    def addNum(self, num: int) -> None:
    self.list_num.add(num)
    def findMedian(self) -> float:
    n = len(self.list_num)
    if n % 2:
    return self.list_num[n//2]
    return (self.list_num[n//2] + self.list_num[n//2-1])/2

  • @asdfgsf9660
    @asdfgsf9660 ปีที่แล้ว +1

    Even tho this is a hard problem and I have trouble with mediums and even some easy problems, I was able to get this one pr easily. So I guess everyone has a type of problem that just clicks. No point beating yourself up over not getting all of the patterns immediately.

  • @EranM
    @EranM 4 หลายเดือนก่อน +1

    the addNum function should be SO MUCH simpler. You don't need to check anything if you keep everything on track.
    if "small" is larger, push element to there and pop from there and push it to "bigger", VICE VERSA.
    The heaps themselves will make the work of whos bigger/smaller.

  • @piyushupadhyay8361
    @piyushupadhyay8361 2 ปีที่แล้ว +3

    thanks neetcode for amazing explanation !!!!.....
    you are doing a great job man.....there are a few youtube channels which implement problem code in python...YOU ARE THE BEST OF THEM 👑

  • @houmankamali5617
    @houmankamali5617 2 ปีที่แล้ว +12

    An alternative solution could be to use self-balancing trees (AVL, Red-Black, etc.) where all the operations are log_n. However, we don't need find and instead just need to maintain a separate pointer to the median, then after each insert, depending on whether the inserted number is greater or less the current median, we move the median pointer to the element adjacent to the previous median.

  • @yashshukla1637
    @yashshukla1637 14 วันที่ผ่านมา

    class MedianFinder:
    def __init__(self):
    self.small = [] # the smaller half of the list, max heap (invert min-heap)
    self.large = [] # the larger half of the list, min heap
    def addNum(self, num):
    if len(self.small) == len(self.large):
    heappush(self.large, -heappushpop(self.small, -num))
    else:
    heappush(self.small, -heappushpop(self.large, num))
    def findMedian(self):
    if len(self.small) == len(self.large):
    return float(self.large[0] - self.small[0]) / 2.0
    else:
    return float(self.large[0])

  • @paulchino81
    @paulchino81 3 ปีที่แล้ว +6

    Thanks for making this clear!!!

  • @austinyu9839
    @austinyu9839 2 ปีที่แล้ว +3

    Hi, I have a question about this question. Can I use binary search to do insertion sort when adding a new number? This will also be a O(logn) algorithm

    • @peterparker892
      @peterparker892 2 ปีที่แล้ว +6

      But only to find the insertion index. To actually insert it would still be O(n) because you potentially have to move/shift every item in the list.

    • @austinyu9839
      @austinyu9839 2 ปีที่แล้ว

      @@peterparker892 Make sense! Thank u so much!

  • @sdsunjay
    @sdsunjay 11 หลายเดือนก่อน

    this code runs faster:
    ```
    class MedianFinder:
    def __init__(self):
    self.small = [] # the smaller half of the list, max heap (invert min-heap)
    self.large = [] # the larger half of the list, min heap
    def addNum(self, num):
    if len(self.small) == len(self.large):
    heappush(self.large, -heappushpop(self.small, -num))
    else:
    heappush(self.small, -heappushpop(self.large, num))
    def findMedian(self):
    if len(self.small) == len(self.large):
    return float(self.large[0] - self.small[0]) / 2.0
    else:
    return float(self.large[0])
    ```

  • @yogyajain-e9h
    @yogyajain-e9h 3 วันที่ผ่านมา

    I think that the heap solution is not much intuitive but thanks neetcode for such good explanation

  • @nikhilaradhya4088
    @nikhilaradhya4088 ปีที่แล้ว +1

    This algorithm should be in Bible

  • @Tribalchief69690
    @Tribalchief69690 3 หลายเดือนก่อน

    I have 0 knowledge of dsa or oop yet i think this is the easiest leetcode problem there ever exists.
    I can do this only using C !

  • @alaningersoll9483
    @alaningersoll9483 2 หลายเดือนก่อน

    Any reason why insertion via binary search wouldn't work in the same time complexity?

  • @mwave3388
    @mwave3388 2 ปีที่แล้ว +1

    The solution is understandable, but it would take weeks of full time work for me, to find it by myself. As every test, mock code verifies your knowledge of patterns.

  • @TechPeck-zm9pe
    @TechPeck-zm9pe 6 หลายเดือนก่อน

    Instead of a heap can't we maintain a simple list. For every insert we add the element in the sorted order using binary search.

  • @AnandKumar-kz3ls
    @AnandKumar-kz3ls ปีที่แล้ว +1

    whenever i came to see your approch i just had to watch 1/3 of the video cuz your explanation is soo good

  • @dorondavid4698
    @dorondavid4698 2 ปีที่แล้ว +1

    Don't complain about Python Min Heaps...C# doesn't support ANY heaps out of the box at all :(

    • @diegosps
      @diegosps 2 ปีที่แล้ว +2

      Since .net 6 there is a PriorityQueue implementation inside the framework: docs.microsoft.com/en-us/dotnet/api/system.collections.generic.priorityqueue-2?view=net-6.0

  • @tejakovvuri3042
    @tejakovvuri3042 3 หลายเดือนก่อน

    Why didn't you use -1* in find median def while calculating median ?

  • @rajshah9129
    @rajshah9129 9 หลายเดือนก่อน

    How to come up that we require 2 priority queue one of min and one of max pls give intuition also

  • @Voidwanderer571
    @Voidwanderer571 7 หลายเดือนก่อน

    Hello sir could you please explain why sometimes it's self.small but sometimes it's just small?? I'm really confused, thank you!

    • @OGKix
      @OGKix 7 หลายเดือนก่อน

      He fixes it in the end for all of them to have the self.

  • @ChocolateMilkCultLeader
    @ChocolateMilkCultLeader 2 ปีที่แล้ว +2

    Great explanation. Couldn't heaps be avoided in this case? We can insert the values from our stream into our list in log time by using binary search (by definition empty and lists of size 1 are always sorted) and then find the median that way. Am I missing something?

    • @dadisuperman3472
      @dadisuperman3472 2 ปีที่แล้ว

      N log N

    • @dadisuperman3472
      @dadisuperman3472 2 ปีที่แล้ว

      Yes.
      For 1 elem -> log1
      2->log2
      3->log3
      ...etc
      N->logN
      Do the sum:
      Log1+log2+log3+...+logn=log(1×2×3×4×..×n)=log(n!)~nlogn
      So it is
      NlogN

    • @mirkowaechter
      @mirkowaechter 2 ปีที่แล้ว

      @@dadisuperman3472 isn't that the same for the heap solution?

    • @dadisuperman3472
      @dadisuperman3472 2 ปีที่แล้ว

      @@mirkowaechter yes almost.
      The video's solution is 3*NlogN which at infinity is equivalent to NlogN.
      But sorted insertion as in the question above, is exactly NlogN

    • @atd233
      @atd233 2 ปีที่แล้ว

      @@dadisuperman3472 the video solution is O(logn), not O(nlogn).

  • @zach2252
    @zach2252 2 หลายเดือนก่อน

    I think it would be way easier to solve with if len(self.small) == len(self.large) instead

    • @kvtys
      @kvtys 7 วันที่ผ่านมา

      i think you must've fell asleep while watching

  • @MsSkip60
    @MsSkip60 3 ปีที่แล้ว +6

    Great stuff, thanks mate! I've been thinking off migrating to Python for interviews but the edge case with max heap again made me stop doing that :)

    • @tengamangapiu
      @tengamangapiu 3 ปีที่แล้ว

      Good point, the language used should add more time ( as there's usually more compilation errors going from Python to Java to say C++ )

  • @Akash0515
    @Akash0515 9 หลายเดือนก่อน

    I Don't understand when we are chcking if every num in small is

  • @thomasmoore6963
    @thomasmoore6963 5 หลายเดือนก่อน

    Got asked this during a 20 minute technical interview for a FAANG internship. Got the brute force solution but didn't get anywhere close to optimal. Safe to say I didn't do well 😔

  • @Narendra_Trump
    @Narendra_Trump 2 ปีที่แล้ว

    Hey
    Great explaination!
    Why is this code not working: Can you please help!
    Thanks in Advance.
    class MedianFinder {
    public:
    vector arr;
    MedianFinder() {
    arr.clear();
    }
    void addNum(int num) {
    arr.push_back(num);
    }
    double findMedian() {
    int n = arr.size();
    std::nth_element(arr.begin(), arr.begin()+n/2, arr.end());
    std::nth_element(arr.begin(), arr.begin()+(n/2-1),arr.end());
    if(n&1) return arr[n/2];
    return ((arr[n/2-1]+arr[n/2])/2);
    }
    };

  • @MichaelShingo
    @MichaelShingo ปีที่แล้ว

    not too bad! I thought I had to implement my own heap

  • @fruitshopowner2505
    @fruitshopowner2505 4 หลายเดือนก่อน

    Man you're actually so goated. These explanations legitimely make me understand the structures and concepts and I can code out the solution from understanding the logic before you go through writing the code. While I'm sure Google was great, I'm glad you're focusing full time on this because you're the absolute best resource I've come across

  • @debpriyaseal3538
    @debpriyaseal3538 ปีที่แล้ว

    Shouldn't the return self.small[0] should be -1* self.small[0] in the findMedian method. As we are artificially adding them for our purposes.

  • @alperefesahin
    @alperefesahin 23 วันที่ผ่านมา

    did not like that question..

  • @tinymurky7329
    @tinymurky7329 ปีที่แล้ว

    Speed will be a little be faster if we check item is large or equal to self.large[0], then decide which heap to add
    import heapq
    class MedianFinder:
    def __init__(self):
    self.small = [] # maxheap
    self.large = [] # minheap
    heapq.heapify(self.small)
    heapq.heapify(self.large)
    def addNum(self, num: int) -> None:
    if (self.large and (self.large[0]) 1:
    temp = -1 * heapq.heappop(self.small)
    heapq.heappush(self.large, temp)
    if len(self.small) - len(self.large) < -1:
    temp = -1 * heapq.heappop(self.large)
    heapq.heappush(self.small, temp)
    def findMedian(self) -> float:
    if len(self.small) > len(self.large):
    return -1 * self.small[0]
    if len(self.large) > len(self.small):
    return self.large[0]
    return (-1 * self.small[0] + self.large[0])/2

  • @jacobp6891
    @jacobp6891 ปีที่แล้ว

    I don't understand why line 16 doesn't output "IndexError: list index out of range" when he tries to get the first index of the empty large heap. What am I missing?

  • @yaswanthkosuru
    @yaswanthkosuru 2 ปีที่แล้ว +1

    we can also do with sortedlist from sorted containers

  • @tesfalemeshetu4675
    @tesfalemeshetu4675 2 หลายเดือนก่อน

    for anyone looking for a brief solution check this out :
    class MedianFinder:
    def __init__(self):
    self.heap = [] , []

    def addNum(self, num: int) -> None:
    left , right = self.heap
    heapq.heappush(left , -heappushpop(right , num))
    if len(right) < len(left):
    heapq.heappush(right , -heapq.heappop(left))

    def findMedian(self) -> float:
    left, right = self.heap
    if len(right) > len(left):
    return float(right[0])
    else:
    return float(-left[0]+ right[0])/2

  • @mercurialidea
    @mercurialidea 11 หลายเดือนก่อน

    For some interesting reason it runs a bit slower, if we add incoming num into the big heap instead of small, and make the necessary amendment to the code to pop from big instead of small.

  • @reinforcer9000
    @reinforcer9000 2 ปีที่แล้ว +5

    question: if you're doing a bunch of logn operations for every add, isn't this a nlogn algorithm? if you're adding n elements, and each time you'll do logn operations, this becomes nlogn. you may as well have just sorted it.

    • @dadisuperman3472
      @dadisuperman3472 2 ปีที่แล้ว

      Well spotted

    • @sasageyo9571
      @sasageyo9571 2 ปีที่แล้ว +7

      sorting 1 time takes nlogn , sorting n times will take (n^2)log(n) :)

  • @symbol767
    @symbol767 2 ปีที่แล้ว

    Jesus, on an interview I have no idea how they expect us to come up with this if we never saw this heap solution before. I'm just gonna give them the in-order sorting solution and talk to them about the 2 heap solution if I ever get this, cause this is tough to remember.

  • @lethanhminh8443
    @lethanhminh8443 ปีที่แล้ว

    instead of using heap or priority queues. You can use segment tree . Let me explain , each node from l to r represents the position where each value in the segment l , r lies in the sorted array . Then you can use lazy update to update the position and binary search in segment tree to find the median :"D have fun.

  • @manavjain8805
    @manavjain8805 2 ปีที่แล้ว +1

    What is the time complexity of this solution ?

  • @sauravchandra10
    @sauravchandra10 ปีที่แล้ว

    I believe in C++, we cant delete arbitrary elements from heap, so we would have to modifications in this approach.

  • @hinocenciopaulo
    @hinocenciopaulo 8 หลายเดือนก่อน

    that's fucking beautiful

  • @venkatasriharsha4227
    @venkatasriharsha4227 3 ปีที่แล้ว +1

    Thanks for making this look easy and Isn't that O(n) as we are finding lengths of small and large?

    • @PavelBogart
      @PavelBogart 2 ปีที่แล้ว +1

      len() function in Python runs in O(1) complexity

  • @daumtto
    @daumtto 9 หลายเดือนก่อน

    Your explaining is good as f

  • @tesfalemeshetu4675
    @tesfalemeshetu4675 2 หลายเดือนก่อน

    Thank you Neetcode for the amazing job you are doing.

  • @shooby117
    @shooby117 2 ปีที่แล้ว

    Had to make it a 2.0 instead of 2 because of some bullshit rounding error at the end

  • @srikanthraj7413
    @srikanthraj7413 8 หลายเดือนก่อน

    Great Explanation! - Question: Why can't we find the index to insert the element to array list using Binary Search and add it to that index as we go? Is it because insertion to a particular index in ArrayList is O(n) operation?

    • @akshaychavan5511
      @akshaychavan5511 7 หลายเดือนก่อน

      Yup, that's correct! You will need to shift the elements so it will be O(n).

  • @srijeetful
    @srijeetful 2 ปีที่แล้ว +1

    very well explained !!!

  • @PippyPappyPatterson
    @PippyPappyPatterson 2 ปีที่แล้ว

    I kept fucking up the `* -1` to use a Minimum Heap as a Maximum Heap, so I just made little functions to do it for me. Posting it in case someone else finds the approach helpful:
    ```python3
    def maxPush(heap: list[float], num: float) -> None:
    heapq.heappush(heap, num * -1)
    def maxPop(heap: list[float]) -> float:
    return heapq.heappop(heap) * -1
    def maxPeak(heap: list[float]) -> float:
    return heap[0] * -1
    class MedianFinder:
    def __init__(self):
    self.small = []
    self.large = []
    def addNum(self, num: int) -> None:
    s, l = self.small, self.large
    maxPush(s, num)
    # Ensure heap equality
    if (s and l and maxPeak(s) > l[0]
    ):
    val = maxPop(s)
    heapq.heappush(l, val)
    # Ensure heap sizes within 1
    if len(s) > len(l) + 1:
    val = maxPop(s)
    heapq.heappush(l, val)
    if len(s) + 1 < len(l):
    val = heapq.heappop(l)
    maxPush(s, val)
    def findMedian(self) -> float:
    s, l = self.small, self.large
    if len(s) > len(l):
    return maxPeak(s)
    elif len(s) < len(l):
    return l[0]
    return (maxPeak(s) + l[0]) / 2
    ```

    • @PippyPappyPatterson
      @PippyPappyPatterson 2 ปีที่แล้ว

      Okay, after implementing these functions I got better at just doing the `* -1` instead. But, if you're getting tripped up by it, I still recommend it.

  • @tanayshah275
    @tanayshah275 3 ปีที่แล้ว +1

    Best Explanation!

  • @nguyen-dev
    @nguyen-dev ปีที่แล้ว +3

    Does anyone know how to answer the follow-up questions?
    1. If all integer numbers from the stream are in the range [0, 100], how would you optimize your solution?
    I think we can replace 2 Heaps with 2 TreeMaps. The key is the number, value is the count. We need to keep the sum of values of 2 TreeMaps and update each time we balance size of the 2 TreeMaps.
    Because the we have 100 keys => time is O(1) and space is also O(1)
    2. If 99% of all integer numbers from the stream are in the range [0, 100], how would you optimize your solution?
    I think 2 TreeMaps still work without changing in this case. But time and space complexity is the same as the original problem.
    What do you think?

    • @muhammadsajawalsial8959
      @muhammadsajawalsial8959 ปีที่แล้ว +2

      I think you can do this with a simple array. Store frequency of each number. In the getMedian() function, loop through the array and find the index where commulative frequency is equal to (number_of_elements/2).
      In case only 99% numbers are between 1-100, the median will definitely fall between 1-100. In such a case, store count of numbers lesser than 1 and greater than 100 and find the element at index (total_no_of_elements/2).
      Two things I'd like to point out:
      1) the above methods are for odd number of elements. Even no of elements just require a simple check.
      2) I am horrible at this so I can't be certain. I tried but if you think I am wrong, I'd be happy to hear.

  • @hoyinli7462
    @hoyinli7462 3 ปีที่แล้ว

    why you dun use sortedcontainers? is it we are not allowed to use this library?

  • @il5083
    @il5083 ปีที่แล้ว

    Binary search actually works for this problem.

  • @Pithukuly
    @Pithukuly 4 หลายเดือนก่อน

    this is amazing

  • @amanimagdi150
    @amanimagdi150 3 หลายเดือนก่อน

    perfect solution and illustration, thanks so much

  • @zifanxu522
    @zifanxu522 3 ปีที่แล้ว +2

    Thanks!

    • @zifanxu522
      @zifanxu522 3 ปีที่แล้ว

      wow, it's automatic. Thanks a lot for all the awesome videos, hope you enjoy your my coffee and have a great weekend!

    • @NeetCode
      @NeetCode  3 ปีที่แล้ว

      Thank you so much, I really appreciate it! 😊

    • @zifanxu522
      @zifanxu522 3 ปีที่แล้ว +2

      @@NeetCode Your videos lift my python skill to the next level. Will let you know when I get my first offer! Appreciate it!

  • @hwang1607
    @hwang1607 8 หลายเดือนก่อน

    i enjoyed

  • @pl5778
    @pl5778 ปีที่แล้ว

    best explanation ever! thank you so much for this.

  • @kritmok1875
    @kritmok1875 11 หลายเดือนก่อน +2

    Thanks for the explanation!
    I found the below solution a bit cleaner yet still intuitive.
    The idea is that for every new number,
    1. we first push it to the small heap
    2. then pop from small heap and push to big heap
    3. check if the small heap is too small(not balanced)
    the code can be reduced largely in this case:
    class MedianFinder:
    def __init__(self):
    self.maxHeap = []
    self.minHeap = []
    def addNum(self, num: int) -> None:
    heapq.heappush(self.maxHeap, num * -1)
    heapq.heappush(self.minHeap, heapq.heappop(self.maxHeap) * -1)
    if len(self.minHeap) > len(self.maxHeap):
    heapq.heappush(self.maxHeap, heapq.heappop(self.minHeap) * -1)
    def findMedian(self) -> float:
    if len(self.maxHeap) == len(self.minHeap):
    return (self.minHeap[0] + self.maxHeap[0] * -1) /2
    else:
    return self.maxHeap[0] * -1

  • @ibtesamahmed7612
    @ibtesamahmed7612 2 ปีที่แล้ว

    isn't heappop() an O(logn) operation, since the heap has to be rearranged later, instead of O(1) ?

  • @swethasekhar9045
    @swethasekhar9045 3 หลายเดือนก่อน

    Thanks!

  • @laumatthew71
    @laumatthew71 ปีที่แล้ว

    amazing explanation, thank you very much sir !

  • @aishwaryaranghar3385
    @aishwaryaranghar3385 2 ปีที่แล้ว

    Thank You for the amazing explanation Neetcode

  • @krateskim4169
    @krateskim4169 ปีที่แล้ว

    i like the way you think ,its impressive

  • @urrahman196
    @urrahman196 2 ปีที่แล้ว

    anyone can tell me that , at 23:16, is there any case where the input will enter into line no 24? if so what could be that sample input?

    • @taymurnaeem50
      @taymurnaeem50 ปีที่แล้ว

      Go to 12:51, it will answer your query.

  • @deep87
    @deep87 2 ปีที่แล้ว

    Why can't we just use a single heap?

  • @artyquantum7283
    @artyquantum7283 2 ปีที่แล้ว

    Cant we do it in set in which insert in sorted order and will be less painful than using two heaps. max insert complexity is still log(n) and median will also be O(1)

    • @thewatcherlollol
      @thewatcherlollol 2 ปีที่แล้ว

      Sets don't insert in sorted order...

    • @artyquantum7283
      @artyquantum7283 2 ปีที่แล้ว

      @@thewatcherlollol Are you sure ?
      Check difference between
      and in C++. if a certain language does not have an ordered set then it does not mean it does not exist. You can always make one if a language's std library does not have it.

  • @jongxina3595
    @jongxina3595 2 ปีที่แล้ว

    cant u use an order statistic tree for this?