Kaiwen Wang
Kaiwen Wang
  • 4
  • 58 876
Conditional Language Policy for Steerable Alignment
📝 "Conditional Language Policy: A General Framework for Steerable Multi-Objective Finetuning", to appear at EMNLP 2024.
🔗 Paper link: arxiv.org/abs/2407.15762
Abstract: Reward-based finetuning is crucial for aligning language policies with intended behaviors (e.g., creativity and safety). A key challenge is to develop steerable language models that trade-off multiple (conflicting) objectives in a flexible and efficient manner. This paper presents Conditional Language Policy (CLP), a general framework for finetuning language models on multiple objectives. Building on techniques from multi-task training and parameter-efficient finetuning, CLP learn steerable models that effectively trade-off conflicting objectives at inference time. Notably, this does not require training or maintaining multiple models to achieve different trade-offs between the objectives. Through extensive experiments and ablations on two summarization datasets, we show that CLP learns steerable language models that outperform and Pareto-dominate the existing approaches for multi-objective finetuning.
Keywords: Reinforcement Learning, Multi-Objective Finetuning, Multi-task Learning, Parameter Efficient Training
มุมมอง: 40

วีดีโอ

Day of a Computer Science PhD at Cornell
มุมมอง 46K2 ปีที่แล้ว
A day in the life of a Computer Science PhD at Cornell University (Ithaca Campus). Music Adventures - A Himitsu Destination - MBB Sparks - Chaël Rainbow - JayJen
Construction of suffix arrays
มุมมอง 12K5 ปีที่แล้ว
A tutorial on the efficient construction of suffix arrays in O(n log(n))
Mazify! CMU 15-112 Term Project
มุมมอง 1.1K7 ปีที่แล้ว
Mazify! 15-112 Term Project for Fall 2016 at Carnegie Mellon University. Written in Python3, with Tkinter and Pillow, using techniques such as recursive backtracking, k-means segmentation and graph theory. Github: github.com/kaiwenw/Mazify-

ความคิดเห็น

  • @Kondlapudi
    @Kondlapudi 2 วันที่ผ่านมา

    trollolololololololololololol

  • @ankursingh8296
    @ankursingh8296 3 หลายเดือนก่อน

    Can a Cornell phd econometrics student take classes in ml, 3 to 4 courses in CS department?

  • @engineerquratulain1034
    @engineerquratulain1034 8 หลายเดือนก่อน

    Hello can you help to write thesis in machine language?

  • @xenonmob
    @xenonmob 10 หลายเดือนก่อน

    thank you for using upbeat music instead of wrist-slitting depressing lo fi !

  • @plugplay5063
    @plugplay5063 ปีที่แล้ว

    unsupervised😂😂 ur a ML STUDENT BRO

  • @codingwithifeanyi5379
    @codingwithifeanyi5379 ปีที่แล้ว

    how old are u please?

  • @amorstodomingo2798
    @amorstodomingo2798 2 ปีที่แล้ว

    Just subscribed. Looking at Cornell’s program. Thanks for sharing

  • @AsrifYusoff
    @AsrifYusoff 2 ปีที่แล้ว

    Great content and subscribed! Would love hear what you think of our grad school channel. Thank you.

  • @tcantr
    @tcantr 2 ปีที่แล้ว

    Great explanation of prefix doubling. Thanks for this very helpful video 🙏

  • @beatrizcarmona9051
    @beatrizcarmona9051 2 ปีที่แล้ว

    Fellow first-year Cornell PhD student here, not sure why i'm watching this bc my life is pretty much exactly the same as what's in this vlog hahaha. great vid tho!

  • @JamesBrodski
    @JamesBrodski 2 ปีที่แล้ว

    Thank you for making this video! So cool!

  • @hileamlakmulugeta5959
    @hileamlakmulugeta5959 2 ปีที่แล้ว

    Keep it up!

  • @vyem7283
    @vyem7283 2 ปีที่แล้ว

    How old are you?

  • @ralph6819
    @ralph6819 2 ปีที่แล้ว

    you need to tamp your espresso

  • @chandrachudpati902
    @chandrachudpati902 2 ปีที่แล้ว

    Only south asian profs

  • @helloworld4788
    @helloworld4788 2 ปีที่แล้ว

    Where did you get your undergraduate degree

  • @gabiedubin
    @gabiedubin 2 ปีที่แล้ว

    first year cs student at the Hebrew university of jerusalem and i gotta say , doing a PHD in computer science sounds pretty hellish to me right now

  • @starlight7499
    @starlight7499 2 ปีที่แล้ว

    CS is not easy but very in demand , rewarding and pays very well. Keep up the good work 👌 .

    • @Alexandre-xz6jm
      @Alexandre-xz6jm 2 ปีที่แล้ว

      Lol he starts at 10h30 and goes to sleep before midnight, what’s hard about that

    • @stefanfaulkner5169
      @stefanfaulkner5169 2 ปีที่แล้ว

      ​@@Alexandre-xz6jm Smh, let's see you go ahead and do a Ph.D. CS Program at an Ivy League university since it's not hard............ Assuming you could even be accepted in the first place.

    • @gaoalexander73
      @gaoalexander73 2 ปีที่แล้ว

      @@Alexandre-xz6jm "Computer science isn't hard compared to majors like engineering" , uhh no it is actually. You should stop

  • @MrVontar
    @MrVontar 2 ปีที่แล้ว

    Nice, I am doing software engineering although I kinda hate academia in general

  • @isaacstamper7798
    @isaacstamper7798 2 ปีที่แล้ว

    Bro that book is extremely cringe. Kissinger is one of the most evil people in modern history

  • @isaacstamper7798
    @isaacstamper7798 2 ปีที่แล้ว

    Gonna be starting a PhD in stats at Binghamton university next year. Cool to see what the lifestyle is like for other students

  • @zigakovacic4818
    @zigakovacic4818 2 ปีที่แล้ว

    A great video! I'm also considering a PhD (not sure if at Cornell) after I finish my studies at Cornell!

    • @SamElle
      @SamElle 2 ปีที่แล้ว

      I’m a phd student at Cornell!!! I also did undergrad there

  • @owen246
    @owen246 2 ปีที่แล้ว

    Hi Kaiwen, Currently physics phd student at Brown looking at theoretical deep learning/stat mech. Would love to hear more about your story leaving facebook/employment/money to pursue a PhD!

    • @GarrettLawrence
      @GarrettLawrence 2 ปีที่แล้ว

      Does Brown have good engineering programs? How would you compare it to a state school like Michigan, UNC or UF?

    • @brandonmckinzie2737
      @brandonmckinzie2737 2 ปีที่แล้ว

      Same here. In a similar position doing ML at MANGA considering a PhD. Would love to hear more details on this!

    • @owen246
      @owen246 2 ปีที่แล้ว

      @@GarrettLawrence Although I've only been here for grad school, from what I can see it's very undergraduate focused. Admittedly the resources at Brown's disposal in the Engineering department don't compare to Michigan, UNC, etc., but the education that you'll receive at either school should be fine. However, Brown is really really known for their CS department and a lot of students go into software engineering at FANG or go into finance (I've already been recruited for quant stuff). I assume this isn't the case at schools like Michigan, Georgia Tech, or similar institutes, but again I can't attest to their programs since I don't attend these schools. Word of advice: don't take out debt if you can, most of the time a rank 50 school on a full-ride will be more worth it than a rank 30 school you pay for, but again, it is much more nuanced than this.

  • @dacioferreira7127
    @dacioferreira7127 2 ปีที่แล้ว

    Awesome. More video like this.

  • @rohitgampa7972
    @rohitgampa7972 2 ปีที่แล้ว

    Cool

  • @letechnicaljames
    @letechnicaljames 2 ปีที่แล้ว

    Great video! Had to sub! :)

  • @onur6260
    @onur6260 2 ปีที่แล้ว

    Very inspiring vlog :)

  • @usmleck7000
    @usmleck7000 2 ปีที่แล้ว

    Keep it up 👍

  • @lu-codes3737
    @lu-codes3737 2 ปีที่แล้ว

    very cool!

  • @Mahhawn
    @Mahhawn 2 ปีที่แล้ว

    loved it, keep going

  • @leojin5151
    @leojin5151 2 ปีที่แล้ว

    Great VLOG! Please keep updating!

  • @lysatolentino5652
    @lysatolentino5652 2 ปีที่แล้ว

    How to survive PhD class! hehehe good luck :)

  • @shiftingleaf8264
    @shiftingleaf8264 2 ปีที่แล้ว

    Your vibe is so comforting I love watching and you should record the yoga too!

  • @abdulkadirjamia9431
    @abdulkadirjamia9431 2 ปีที่แล้ว

    I really like it 👍

  • @alihussain2588
    @alihussain2588 3 ปีที่แล้ว

    Hey! I loved your vlog on what it would be like to be researching in order to become a PhD at Cornell. I actually applied to Cornell so fingers crossed I get in. Currently I'm a senior in highschool and looking forward to doing Computer Science in college as my major; I have some experience with coding but would really like to become more advanced and eventually work with algorithms and datasets, etc. Do you have any suggestions for research that I should be doing, texts I should be reading, or ways that I can be getting ahead of the game and making myself the most advanced that I possibly can be at this stage? I really look forward to hearing your wisdom. And if it is possible I would love to maintain some sort of contact (even though I understand this is a bit odd of a request) so that I may ask anymore questions--if you're willing to answer them of course. Amazing vlog, thank you so much for sharing. I would love to see more!

  • @williamqian229
    @williamqian229 3 ปีที่แล้ว

    wow! awesome video!

  • @JennyJee
    @JennyJee 3 ปีที่แล้ว

    Awesome vlog! Love this! :D

  • @terezamiklosova104
    @terezamiklosova104 3 ปีที่แล้ว

    Amazingly clear and explanative video. This was exactly what I've been looking for. Thank you, it helped me a LOT!

  • @cccccc864
    @cccccc864 4 ปีที่แล้ว

    Thanks dude.

  • @yusufahmed2233
    @yusufahmed2233 4 ปีที่แล้ว

    Bro... Thank you 😭😭😍😍

  • @thunder7753
    @thunder7753 4 ปีที่แล้ว

    Hey can u please explain it using strings instead of numbers and making tables for each state, i would be very grateful for that

  • @jaimeeduardo159
    @jaimeeduardo159 4 ปีที่แล้ว

    Excellent video

  • @mezhoudabdelmoumen8026
    @mezhoudabdelmoumen8026 4 ปีที่แล้ว

    is that the manber and myers's algorithm ?

  • @KuldipGhotane
    @KuldipGhotane 4 ปีที่แล้ว

    thanks for making this video.

    • @kaiwenw
      @kaiwenw 4 ปีที่แล้ว

      My pleasure!

    • @KuldipGhotane
      @KuldipGhotane 4 ปีที่แล้ว

      @@kaiwenw can you create a video for suffix array as there is little information about them on youtube

  • @hackerhaddi
    @hackerhaddi 4 ปีที่แล้ว

    anyone noticed his adam's apple lol, great video though

  • @mondal1839
    @mondal1839 4 ปีที่แล้ว

    Could you please provide the code in c++?

  • @TheComsVerge
    @TheComsVerge 4 ปีที่แล้ว

    I want to hit like multiple😍😍 times but youtube does not allow this. Thanks for such a great video. 😃😃

    • @kaiwenw
      @kaiwenw 4 ปีที่แล้ว

      Thank you so much 😀

  • @Mnnvint
    @Mnnvint 4 ปีที่แล้ว

    This was a great explanation of the "prefix doubling" approach of constructing suffix arrays! You really helped me get this, and (fingers crossed) getting one step closer to understanding the linear time suffix array construction algorithms.

  • @doge-coin
    @doge-coin 4 ปีที่แล้ว

    Thanks for uploading this video. This is very clear and helpful. :D

  • @jaguar1101
    @jaguar1101 5 ปีที่แล้ว

    Been looking all over the internet for an explanation like this. Thank you for a great and informative video. If you make more similar videos I'd definitely watch them.

    • @kaiwenw
      @kaiwenw 5 ปีที่แล้ว

      Thanks!! What other content do you want to see?

    • @jaguar1101
      @jaguar1101 5 ปีที่แล้ว

      More useful data structures would be nice, perhaps lcp array as a follow up to this video for example.