Property-Based Testing In Python: Hypothesis is AWESOME

แชร์
ฝัง
  • เผยแพร่เมื่อ 12 ม.ค. 2025

ความคิดเห็น • 95

  • @ArjanCodes
    @ArjanCodes  ปีที่แล้ว +1

    💡 Get my FREE 7-step guide to help you consistently design great software: arjancodes.com/designguide.

  • @Mutual_Information
    @Mutual_Information 2 ปีที่แล้ว +41

    Excellent vid - much clearer than the documentation. One feature worth emphasizing is the automatic testing code generation. That’s the big gun for creating a ton of test coverage.

  • @robertbrummayer4908
    @robertbrummayer4908 2 ปีที่แล้ว +24

    I told you property-based testing is awesome :). Great video, Arjan! One approach I like a lot is to use a test oracle. When I have an optimized and therefore complicated implementation of an algorithm, I often also implement a simple version of the algorithm which is easy to understand, but not optimized and not efficient. I use Hypothesis to generate random input, run both implementations, and then assert that they return the same result.

  • @THEMithrandir09
    @THEMithrandir09 2 ปีที่แล้ว +14

    What would be really cool would be a comparison video, first write a small project (idk, sudoku, battleship, whatever) and then write appropriate unit tests for it, and then do it again but using test-driven-development. To me it was hugely interesting how it influenced the way I design my code and how sensible the coupling between tests and code almost automatically becomes.

  • @EndlessPH
    @EndlessPH 2 ปีที่แล้ว +11

    The amount of bugs hypothesis has uncovered on projects I've worked with makes learning it worthwhile

  • @robertbrummayer4908
    @robertbrummayer4908 2 ปีที่แล้ว +3

    Another awesome feature of property-based testing and also Hypothesis is that the failure-inducing input is automatically simplified to give you a small and simple counter example. This is beneficial for debugging.

  • @drooten
    @drooten 2 ปีที่แล้ว +6

    You bring up an interesting capability and much misunderstood feature of Python; Decorators.
    Perhaps this could be a topic for the future?
    Thanks for sharing!

  • @aflous
    @aflous 2 ปีที่แล้ว +5

    Excellent video! I am using pytest quite often but didn't know about hypothesis. This would take my tests to the next level! Keep these videos coming, Arjan

  • @TonyFlexPromo
    @TonyFlexPromo 2 ปีที่แล้ว +2

    Thank you so much for this video. I also had an excellent experience with hypothesis. You covered almost every case, probably except using map and flatmap to generate complex strategies. We found lots of very strange bugs thanks to this framework. Also property based testing is kinda mind blowing at first, but when it becomes a second nature, you even write regular tests much better :)
    Thank you so much for all of your videos, i enjoy every second of it. Best coding channel on youtube ❤

  • @maephisto
    @maephisto 2 ปีที่แล้ว +3

    Didn't know about Hypothesis but I was aware of property-based testing (i.e. verifying a property of the system more than asserting a specific response for a given input). In my experience I was using a combination of unitest and Faker: it seems this approach would be more structured and synthetic. Definitely worth to try in one of my next projects.

  • @mrswats
    @mrswats 2 ปีที่แล้ว +7

    I love hypothesis! It's an awesome package and I think it should be used much more. However I recognise that it can be tricky to know when to use it if you are used to regular ol' unit tests.

  • @ApsJoo
    @ApsJoo 2 ปีที่แล้ว

    I had never heard of property testing and found the subject very very interesting. I would really appreciate more videos on it!

  • @VladimirTheAesthete
    @VladimirTheAesthete 2 ปีที่แล้ว

    This is way better than what I could gather from documentation, kudos!

    • @ArjanCodes
      @ArjanCodes  2 ปีที่แล้ว

      Thanks so much Vladimir, glad it was helpful!

  • @Thehonorablesunpresiding
    @Thehonorablesunpresiding 7 หลายเดือนก่อน

    Thank you for this! I was already kinda doing this, but hypothesis will make the tests clearer and easier to make.

    • @ArjanCodes
      @ArjanCodes  7 หลายเดือนก่อน

      You’re welcome! ☺️

  • @lecume
    @lecume 2 ปีที่แล้ว

    Great video, congrats and keep posting!

    • @ArjanCodes
      @ArjanCodes  2 ปีที่แล้ว +1

      Thanks so much Javier, glad you liked it!

  • @betawolf2095
    @betawolf2095 2 ปีที่แล้ว

    Nice, Arjan. I like your clear way of explaining sometimes complicated concepts. You taught me Python development the right way!

  • @MedievalChips
    @MedievalChips 2 ปีที่แล้ว +1

    Love the videos Arjan keep them coming.. Can't wait every time!

  • @lunalect
    @lunalect 2 ปีที่แล้ว +1

    Great video! I’d love it if you could do a vid on unit testing, integration testing and data quality checking for a data engineering shop. I know it’s not exactly your wheel house, but curious what you’d come up with.

  • @kevon217
    @kevon217 ปีที่แล้ว

    The back lighting in your videos are 👌

  • @hcubill
    @hcubill 2 ปีที่แล้ว

    Excellent video would love to hear more about hypothesis!

    • @ArjanCodes
      @ArjanCodes  2 ปีที่แล้ว

      On it, Hector ;).

  • @homayoonalimohammadi9078
    @homayoonalimohammadi9078 2 ปีที่แล้ว

    Your videos are simply awesome. Please provide us with more valuable information specially about hypothesis in depth (as u mentioned).
    Keep up the good work! ✌🏻

  • @MrEloska
    @MrEloska 2 ปีที่แล้ว

    Very good video! I saw hypothesis package one time in our company, but I didn't have time to dive into it. After that vid I'll back there for sure :D

  • @atkinpaul
    @atkinpaul 2 ปีที่แล้ว +1

    Thanks! Very nice. Feels like hypothesis with the faker package could be an elegant way to create, test & stub data.

    • @ArjanCodes
      @ArjanCodes  2 ปีที่แล้ว

      You’re welcome! Indeed, that’s an interesting combination to explore

  • @mikeciul8599
    @mikeciul8599 2 ปีที่แล้ว +1

    The thing that blew my mind about QuickCheck is that when it finds a failing example it shrinks it down to the smallest possible failing example. There are methods to do this with custom strategies but I had a lot of trouble wrapping my brain around them. Can Hypothesis do this? If so, can you do a follow-up about how to shrink your own custom examples?

  • @shaiful7228
    @shaiful7228 2 ปีที่แล้ว +1

    Thank you for this informative video. Could you make a video to showcase property-based testing focused in Django API testing?

  • @VinnyMeller
    @VinnyMeller 2 ปีที่แล้ว

    every time i read about something cool i can count on you to make a video about it within the next few months instead of trying it myself xD
    this will be one i have to add into the mix

    • @ArjanCodes
      @ArjanCodes  2 ปีที่แล้ว

      At your service 😁

  • @JoshPeak
    @JoshPeak 2 ปีที่แล้ว +4

    I’m sold on the idea, but translating it to a real project and not the toy examples is difficult. I think I need a real OSS code base that does it well to demonstrate non-trivial uses in like half a dozen use cases. As a data engineer, I can see the value of treating the inbound data as a random variable in tests but the cost to learn hypothesis always seems a little too steep to implement in a real world scenario.

    • @totalermist
      @totalermist 2 ปีที่แล้ว +1

      I'm on the fence about this, too. I very much like a data-driven approach to testing, as it allows you to collect real-world test cases and add failure cases over time without having to touch the actual test code. The library could be very useful for fuzzing, though, but I don't know whether *all* output of the given generator (e.g. "integers()") is actually tested.
      Would be quite irritating to have the test randomly fail if only a subset is drawn each time. OTOH the run time could be prohibitively long if the number of test cases becomes overwhelmingly large (think matrixes or lists).
      I guess the utility of the library depends heavily on the type of software you're working with.

    • @JoshPeak
      @JoshPeak 2 ปีที่แล้ว +1

      @@totalermist yeah I looked into ScalaCheck which is the equivalent in that ecosystem which is based off the QuickCheck in Haskell. Apparently the strategies run a limited sample of values but hold the seed. So if there is a failure it spits out the failing value and the RNG seed to the failure is repeatable.
      I think I heard in the ScalaCheck equivalent, that when it finds a failing case it tries to reduce it in some way to the simplest failing case. Eg if -1234 fails it’ll try -1. But I also heard the strategies can be biased / weighted to try common failure cases. Like strings() will try an empty string more often than even odds.
      But yeah… a walk through of a real OSS code base with the author of said code base is the master class I am after.

  • @federicomonegaglia3689
    @federicomonegaglia3689 2 ปีที่แล้ว +1

    Hi, I Always follow your videos and find them really useful. I would really like a video about event driven architecture showing both event and command handling, I wasn't able to find any good one of that around the youtube ether..

  • @loic1665
    @loic1665 2 ปีที่แล้ว

    Great video, as usual, thanks!
    This topic was new to me and I would be very interested in a second video on this subjet!
    Maybe as a suggestion: apply this to a more "real life" example? Given a small simple project, what parts would you want to test with normal unit tests, what parts would you want to cover with property-based tests, where to do both? What is really the added value of those extra property-based tests over the classic unit tests (including parametrized tests)? Maybe there are typical things that are very commonly tested with property-based tests? Also, I guess looking at the code coverage is not really the point... is it?
    It's just some thoughts :) I really liked the video, I see that this tool is extremely powerful, but I'm not sure how to apply this to my projects...

  • @robertbrummayer4908
    @robertbrummayer4908 2 ปีที่แล้ว

    Some pedantic comments ^^: test_negative_team_size should be called test_non_positive_team_size since 0 is typically considered neither positive nor negative. It is definitely not negative. I also propose to have a simple unit test for 0 since this is typically an interesting special case. If you just specify max_value to be 0 it is not guaranteed that 0 will be part of the generated random inputs.

    • @ArjanCodes
      @ArjanCodes  2 ปีที่แล้ว +2

      Thanks for the feedback, Robert. Indeed, the naming you suggest is more precise. One way to add the 0 example is by using the @example decorator in Hypothesis and this will make sure Hypothesis includes it in the test data.

    • @robertbrummayer4908
      @robertbrummayer4908 2 ปีที่แล้ว

      @@ArjanCodes Hi Arjan! The example decorator is even better than my proposal. In this way one does not need an additional unit test. Awesome! Best wishes from Austria :)

  • @adarshbattu1568
    @adarshbattu1568 2 ปีที่แล้ว

    Arjan..
    I absolutely ❤️ your videos. You’ve rekindled my passion for Software development. I’ll definitely use it for QA activities.🔎
    BTW can you create a detailed video on code documentation 📑and cover tools like Pdoc3 Sphinx etc. that would be phenomenal

  • @miguelvasquez9849
    @miguelvasquez9849 2 ปีที่แล้ว

    Arjan, did you have a video explaining how to log? I have a problem with logging my program with multiprocessing and filerotating handlers. 😢

  • @ahasibrifat7568
    @ahasibrifat7568 2 ปีที่แล้ว

    Excellent!

  • @hudabdulwahab2499
    @hudabdulwahab2499 2 ปีที่แล้ว

    would love to see how this is used with dictionaries/pandas data!

  • @vulkanosaure
    @vulkanosaure ปีที่แล้ว

    How does hypothesis generate values.
    Since they can't try every possible values (too slow), does the library use a specific strategy to find corner cases ?

  • @edgeeffect
    @edgeeffect ปีที่แล้ว

    I don't get the Python highlighting in VSCode... how the settings decorator is green but all the others are yellow.

  • @dmytroparfeniuk2670
    @dmytroparfeniuk2670 2 ปีที่แล้ว

    Awesome! Thanks!

    • @ArjanCodes
      @ArjanCodes  2 ปีที่แล้ว

      Happy you liked it!

  • @MMarcuzzo
    @MMarcuzzo 2 ปีที่แล้ว

    I've seen this hypothesis in some project before. This is the first video approaching this package, btw

  • @valeriusandof9782
    @valeriusandof9782 2 ปีที่แล้ว +1

    What do you think about using hypothesis for testing an API? Would generating payload data using hypothesis help discover any bugs?

    • @MMarcuzzo
      @MMarcuzzo 2 ปีที่แล้ว

      Why not?

    • @francescobartoli3222
      @francescobartoli3222 2 ปีที่แล้ว +1

      There is schemathesis for APIs

    • @ArjanCodes
      @ArjanCodes  2 ปีที่แล้ว +2

      Sure. For example, if you have an API endpoint for searching documents with a skip and limit parameter, you could use Hypothesis to run tests with various values of skip and limit. Or test endpoints that expect dates with various date values, etc.

  • @Marco-sz6mq
    @Marco-sz6mq 2 ปีที่แล้ว

    Hi! Looking all your videos while I am in the train 😄 really great channel!
    I have a question: do you thing it’s a good practice to use something like ‘test()’ to generate values?
    I do not like this approach, since you don’t have the control of the inputs your are using to test your function.
    While writing I was thinking that maybe this type of testing can be part of the fuzzing testing that maybe can be separated from the unit one

  • @pouet4608
    @pouet4608 2 ปีที่แล้ว

    ok. it does not verify that the code matches specification, but it tracks bugs by matching invariants of function and reciprocical functions?

  • @kiosmallwood576
    @kiosmallwood576 2 ปีที่แล้ว

    Haven't watched it yet but, yes hypothesis is amazing

  • @AndreaGhensi
    @AndreaGhensi 2 ปีที่แล้ว

    Still not entirely sold on this.. the basic tests (encoding) pass also if I replace the functions with "return inp"... and even adding a type checking property test won't cut it either.. sure you don't care about what data to feed the function with, but you need to make sure that for a given input the function return the expected! Or am I missing something here?

  • @nnenad
    @nnenad 2 ปีที่แล้ว +2

    I saw a great way to make hypothesis tests cleaner. When importing you do:
    from hypothesis import strategies as some
    And then
    @given(some.text())

    • @trentsavage
      @trentsavage ปีที่แล้ว

      This is an awesome idea! I can't believe I haven't seen this before

    • @dispatch1347
      @dispatch1347 ปีที่แล้ว

      Stealing this, thank you!

  • @MultiJpva
    @MultiJpva 2 ปีที่แล้ว

    whats your favorite for software development ,windows ,linux or mac?

  • @yinmudino1
    @yinmudino1 ปีที่แล้ว

    Video is cool and enriching. But I have an error when trying to run the hypothesis through the generation. "ValueError: source code string cannot contain null bytes".

    • @yinmudino1
      @yinmudino1 ปีที่แล้ว

      I found the problem. When use the terminal to generate the hypothesis, it will create a file with format UTF 16. Need to change to UTF 8 to work

    • @yinmudino1
      @yinmudino1 ปีที่แล้ว

      You all know how to make the hypothesis generate the test file in UTF 8 instead of UTF 16 through pycharm. No matter how I change the file encoding in pycharm, it will always create the test file in UTF 16.

  • @DeathStocker
    @DeathStocker 2 ปีที่แล้ว

    Thanks a lot! Please cover `mutmut` next.

    • @ArjanCodes
      @ArjanCodes  2 ปีที่แล้ว

      You’re welcome. And that’s also on the list. 😉

  • @TheTilpo
    @TheTilpo 2 ปีที่แล้ว

    hypothesis seems like a nice tool, will definitely try it in my next project.
    One thing I'm slightly confused about is that you suggest property-based testing as better than unit testing. I would call the kind of tests you wrote unit tests.
    Is unit testing really just tests of the form input -> expected output? Hard coding input/ouput examples sounds like a terrible way to write tests. Do people actually do that?

    • @trentsavage
      @trentsavage ปีที่แล้ว

      > unit testing really just tests of the form input -> expected output?
      I'm here to say, yes, that is what some, most, people think is unit testing. And even with property-based testing, you still have to write a good number of this variety of test. How else does the machine know what is "supposed" to happen, without re-writing all the code you are testing in the test itself?

  • @charismaticaazim
    @charismaticaazim 2 ปีที่แล้ว

    Does anyone know if hypothesis package is smart enough to avoid redundant tests OR does it always generate data randomly ? A redundant test has a implies relationship. For e.g: If a number is divisible by 4, then its obviously divisible by 2. Hence, i say the latter test is redundant.

  • @dmytroparfeniuk2670
    @dmytroparfeniuk2670 2 ปีที่แล้ว

    Let's imagine that we have a function than gets 2 arguments: string and integer and returns the boolean.
    For example, it returns True if the second argument (int) == len(first argument)
    Actually, I'd provide parametrized tests for the most common cases with the next structure:
    @parametrize("data, expected_result",
    [
    (("Arjan", 5), True),
    (("Hello", 3), False)
    ]
    )
    Could I do something similar with @composite for example?
    Or any usage with parametrized tests / fixtures / lazy_fixtures / ...
    Would be a great video, IMHO

  • @bwill325
    @bwill325 2 ปีที่แล้ว +1

    Is there an equivalent for JS?

    • @totalermist
      @totalermist 2 ปีที่แล้ว

      There is fast-check: th-cam.com/video/a2J_FSkxWKo/w-d-xo.html
      Also jsverify: dev.to/glebec/property-testing-with-jsverify-part-i-3one
      Both don't use decorators, though. Might be a fun project to port the hypothesis library to JS or TS while keeping the decorator approach.

  • @Klej0aka0Klej
    @Klej0aka0Klej 2 ปีที่แล้ว

    Could somebody tell me how should test for an example a function to calculate sum of squares would look like:
    I got a function:
    def my_func(a, b):
    return a**2 + b**2
    so now for tests
    @given(integers())
    def test_my_func(a, b):
    assert a**2 + b**2 == my_func(a, b)
    so my question is what's the point of a test like it or how to test math inside python? Because for me testing math looks like copying that func to test and checking if they are equal.
    Ofc I am not asking normal unit test:
    @pytest.fixture(a, b, result, [(1, 2, 5), (2, 3, 13)])
    def test_my_func(a, b, result):
    assert my_func(a, b) == result
    so If I think correctly hypothesis for testing math has no value right?

    • @ArjanCodes
      @ArjanCodes  2 ปีที่แล้ว +4

      I agree that testing in this way doesn’t really help because a bug in the math is not found since it’s also in the testing code.
      What you could do instead is test for properties. For example, the sum of two squares is always a positive number or zero. So you could write a test that verifies that this property holds for a bunch of different a’s and b’s. I’m not a mathematician, but there are probably other properties of the sum of two squares that you could write tests for as well.

    • @Klej0aka0Klej
      @Klej0aka0Klej 2 ปีที่แล้ว

      @@ArjanCodes That totally makes sense :)

    • @aflous
      @aflous 2 ปีที่แล้ว

      I don't get the point behind testing plain arithmetic functions, you need to trust the software at least for this basic task 😌

    • @ArjanCodes
      @ArjanCodes  2 ปีที่แล้ว +4

      @Aflous, I think this was intended as a simple example to illustrate testing mathematics functions. In a real software application, you probably wouldn’t have a function like this.

    • @dispatch1347
      @dispatch1347 ปีที่แล้ว

      @@ArjanCodes As a mathematician, another one is using the fact that (a - b)^2 >= 0, so a^2 - 2ab + b^2 >= 0, so a^2 + b^2 >= 2ab. But also the opposite, (a + b)^2 >= 0 so a^2 + b^2 >= -2ab, which helps if one of a or b is negative.
      Another idea is testing the sum of two integers is also an integer.
      Of course we could think about extending to floats, complex numbers, lists of a,b pairs, etc. and you would need to check for different properties.

  • @kayakMike1000
    @kayakMike1000 2 ปีที่แล้ว

    Shouldn't you write tests before you code? Or is that only TDD?

    • @totalermist
      @totalermist 2 ปีที่แล้ว

      That's only TDD in its strictest form. In practice, though, I've never seen it applied consistently.
      Property-based testing is more of an in-between of unit tests (e.g. what TDD is based on), BDD, and integration testing.
      If you only test single code units or functions, it's more towards the UT side, if you use it to test against specs (given-when-then), it's more BDD, etc.

  • @krissn8111
    @krissn8111 2 ปีที่แล้ว

    Quite interesting

    • @ArjanCodes
      @ArjanCodes  2 ปีที่แล้ว

      Thanks Kriss, happy you’re enjoying the content!

  • @thepaulcraft957
    @thepaulcraft957 2 ปีที่แล้ว +1

    i gave u a like bello 😂

  • @terryhobart
    @terryhobart 2 ปีที่แล้ว

    I put my email in the code guide site box several times. Never came. Not in spam.

    • @ArjanCodes
      @ArjanCodes  2 ปีที่แล้ว

      If you send me an email (support@arjancodes.com), I’ll make sure you get the guide.

  • @vampir753
    @vampir753 8 หลายเดือนก่อน

    Elon Musk explaining his Code after acquiring Twitter: 8:20

  • @mikedeboston1023
    @mikedeboston1023 2 ปีที่แล้ว

    Tangent: You've nailed the lighting, but the background sound is a little too loud and distracting, at least to my ears.

  • @jkrigelman
    @jkrigelman 2 ปีที่แล้ว +2

    So this is pythons fuzzer.

    • @fukawitribe
      @fukawitribe 2 ปีที่แล้ว +1

      It is _a_ Python fuzzer, yes.

  • @pablodm9
    @pablodm9 2 ปีที่แล้ว +1

    7:41 don't do that again pls

  • @MrGeordiejon
    @MrGeordiejon 2 หลายเดือนก่อน

    👍 🐕