- 339
- 335 775
MLconf
United States
เข้าร่วมเมื่อ 21 ก.ค. 2013
MLconf was created to host the thought leaders in Machine Learning and Data Science to discuss their most recent experience with applying techniques, tools, algorithms and methodologies to the seemingly impossible problems that occur when dealing with massive and noisy data. MLconf is independent of any outside company or university - it’s simply an organization who gathers the Machine Learning communities in various ways to share their lessons learned and create an environment for the community to coalesce.
MLconf SF 2022: Two competing Applications of Non-parametric Inference, Matthew Schreiner @airbnb
At Airbnb, we quantify the performance of a given page via the Page Performance Score (PPS), a holistic measure of latency that incorporates several signals of user-perceived page performance, from rendering time to interactivity delays. As the event distribution for a given PPS subcomponent is not guaranteed to be gaussian, neither is the PPS distribution. This presents problems for classic parametric inference approaches in an experimentation context (eg: when asking “is ∆PPS = PPS_TREATMENT - PPS_CONTROL significant?”) In an experiment randomized at the user-level, we can compute the p-value associated with ∆PPS by implementing a cluster/block bootstrapping approach to ensure proper partitioning of events. This follows from a “Strict” interpretation of the Null Hypothesis; that users in Control are not fundamentally different from those in Treatment. However, if we revisit the underlying assumption of the Null Hypothesis (users in the two groups are not fundamentally different), this should imply that the distribution of events generated by the blended users are not fundamentally different and we can therefore blend all the events and resample at the performance-event level. This “Lazy” interpretation of the Null Hypothesis ignores the latent structure of the input data and can amplify the False Negative Rate by widening the Null Distribution, but this injection of symmetric noise ultimately generates more conservative p-values at the time of inference. Depending on the business context: (1) a desire for absolute certainty in identifying significant PPS improvement or regression or (2) a fast/directional read on the practical impact of a PPS shift either the “Strict” or “Lazy” bootstrapping algorithm can be applied. In practice, the “Lazy” approach to bootstrapping results in substantial speedup, reducing compute resources and runtime by ≈1.2x, with minimal loss in statistical Power compared to the “Strict” method.
มุมมอง: 47
วีดีโอ
MLconf SF 2022: Semantic Label Representation and Multimodal Categorization, Binwei Yang @Walmart
มุมมอง 327 หลายเดือนก่อน
At the core of any e-commerce product catalog is product categorization. Accurate product categorization not only has an impact on revenue growth but is also a key for good customer experience. However, research has shown that even though the mistakes made by machine learning classification model have been dramatically reduced, the severity of mistakes has not changed much. When model mistakes ...
MLconf SF 2022: Scaling up Lyft’s Growth via Causal ML and Bandits by Lei Tang @lyft
มุมมอง 1417 หลายเดือนก่อน
Machine learning, causal inference, and reinforcement learning have played a key role in scaling up and personalizing Lyft decisions for growth, ranging from an ad exposure, a landing page for user onboarding, an email to be sent to a user, and a coupon/bonus to be dropped to rider/driver. In this talk, we’ll showcase a suite of growth engines powering Lyft’s rider, driver, and business growth,...
MLconf SF 2022: Building Multi-Tenant Compute Systems in the Enterprise, Zachary Hanif @CapitalOne
มุมมอง 887 หลายเดือนก่อน
Capital One’s investments in building out a large engineering organization, fully moving to the cloud, re-architecting our applications and data platforms, and embracing machine learning at scale have made us a pioneer in the ability to scale ML across the enterprise in our industry. At a large financial services institution like ours, multi-tenant software architecture for machine learning at ...
MLconf SF 2022: Few-Shot Learning in Conversational AI by Mahnoosh Mehrabani @interactionsco
มุมมอง 417 หลายเดือนก่อน
When you yell “representative” at a customer service line and get directed to a live agent, you probably have natural language understanding (NLU) to thank. NLU is a crucial piece of conversational artificial intelligence (AI) that transforms human language-whether it be text or spoken-into digestible semantic information for machine comprehension. Interactions, a leading provider of Intelligen...
MLconf SF 2022: Balancing Thin Line Between Data Intelligence & Privacy, Dr. Sherin Mathews @usbank
มุมมอง 327 หลายเดือนก่อน
Federated learning holds great promise in learning from fragmented sensitive data and has revolutionized how machine learning models are trained. This article provides a systematic overview and detailed taxonomy of federated learning. We investigate the existing security challenges in federated learning and provide a comprehensive overview of established defense techniques for data poisoning, i...
MLconf SF 2022: Humans helping machines help humans run machines by Leah Mcguire @Benchling
มุมมอง 307 หลายเดือนก่อน
In this talk I will describe how having specialists inject domain knowledge about the problem being modeled, combined with automation of modeling steps, can produce good quality models across highly varied small datasets. There is a large amount of mission critical, small, tabular data produced across many domains. There are not enough specialists to analyze this data, it is too small and too v...
MLconf SF 2022: Essential Ingredients in Scaling Organizations for ML by Dr. Ali Arsanjani @Google
มุมมอง 1007 หลายเดือนก่อน
How do you scale your skills for end to end machine learning across the life-cycle. What are the key components of a machine learning lifecycle and how can you stitch them together, in what combinations for which use cases, in order to get to the next level of actualization of the ML journey that organizations are embarking upon so they can capitalize on the growth and competitive advantage tha...
MLconf SF 2022: Empowering Traceable & Auditable ML in Production w/Hendrix, Jonathan Jin @Spotify
มุมมอง 1.1K7 หลายเดือนก่อน
Spotify has incorporated machine learning into all corners of the product, ranging from front-and-center features like Discover Weekly, to less obvious, more “hidden” use cases like playlist recommendations and natural-language search. Despite their power and potential, however, machine learning models have earned a reputation for being inscrutable and unexplainable “black boxes.” These effects...
MLconf SF 2022: Driving Autonomous Vehicles Forward w/Real World Applications, Vinutha Kallem @Waymo
มุมมอง 557 หลายเดือนก่อน
The autonomous vehicle industry has made significant progress in recent years, with the launch of fully autonomous ride-hailing services, commercialized autonomous trucking solutions, and local delivery logistics. However, the dynamism of real-world roads presents a set of complex robotics challenges to autonomous driving in urban environments, including busy intersections, narrow streets, ever...
MLconf SF 2022: Industrial Applications of Machine Learning in Search by Jay Wang @MicrosoftKuaishou
มุมมอง 627 หลายเดือนก่อน
Over the last 20 years we have witnessed the applications of machine learning in search, recommendations and advertising. The state of the art machine learning systems are large-scale, near-real-time, multi-modal and intelligent in nature. In this presentation, I’ll focus on a few most recent advances, including faster retrieval, large-scale ranking models, advances in reranking, and experiment...
MLconf SF 2022: A Generative AutoML for Tabular Data by Dr. Mehdi Bahrami @Fujitsu
มุมมอง 597 หลายเดือนก่อน
An Automated-Machine Learning (AutoML) platform aims to automate the process of data engineering, feature engineering, hyper-parameter optimization, training, prediction, and deployment of a model, where it minimizes human supervision in all stages. One of the popular aspects of artificial intelligence utilization across different domains is AutoML for tabular data (structured data). In this ta...
MLconf SF 2022: AI Factsheets as Industry Standard by Armand Ruiz @IBM
มุมมอง 1287 หลายเดือนก่อน
AI Fact Sheets are a lot like packaged food nutrition labels. They contain information about an AI model’s development, capabilities, benchmark performance, and more. In this session, I would like to share how to create transparency and info reports. The tool would generate most, if not all, of the AI Fact Sheet’s information automatically. AI models and services are used in a growing number of...
MLconf SF 2022: Using Deep Learning to Understand Documents by Eitan Anzenberg @Bill_hq (Bill.com)
มุมมอง 667 หลายเดือนก่อน
Extracting key-fields from a variety of document types remains a challenging problem. Services such as AWS, Google Cloud and open-source alternatives provide text extraction to “digitize” images or pdfs to return phrases, words and characters. Processing these outputs is unscalable and error-prone as varied documents require different heuristics, rules or models and new types are uploaded daily...
MLconf NYC 2022: Building a Modern, Datacentric Tech Stack by Davit Buniatyan @activeloop
มุมมอง 597 หลายเดือนก่อน
Over the past 40 years, databases have evolved multiple times to work well for structured data. With the growth of computer vision data, we need solutions specifically optimized for these use cases. In this talk, Davit Buniatyan presents the Database for AI, a data-centric framework resolving common AI data bottlenecks. Learn how by being data-centric, you can (1) achieve up to 95% GPU utilizat...
MLconf NYC 2023: Navigating the Landscape of Bias in Recommender Systems by Amey Dharwadker @meta
มุมมอง 527 หลายเดือนก่อน
MLconf NYC 2023: Navigating the Landscape of Bias in Recommender Systems by Amey Dharwadker @meta
MLconf NYC 2022: Deployment & Workflow Integration to Predict Adverse Events by Yin Aphinyanaphongs
มุมมอง 247 หลายเดือนก่อน
MLconf NYC 2022: Deployment & Workflow Integration to Predict Adverse Events by Yin Aphinyanaphongs
MLconf Online 2021: An end to end ML Platform for Product Decisions by Igor Markov of Facebook
มุมมอง 1257 หลายเดือนก่อน
MLconf Online 2021: An end to end ML Platform for Product Decisions by Igor Markov of Facebook
MLconf Online 2021: The Enterprise Neurosystem by Bill Wright of Red Hat and Ryan Coffee of SLAC
มุมมอง 5247 หลายเดือนก่อน
MLconf Online 2021: The Enterprise Neurosystem by Bill Wright of Red Hat and Ryan Coffee of SLAC
MLconf Online 2021: Declarative AI for the Modern Data Stack by Jordan Volz of Continual
มุมมอง 407 หลายเดือนก่อน
MLconf Online 2021: Declarative AI for the Modern Data Stack by Jordan Volz of Continual
MLconf NYC 2022: Expectations vs. Reality Machine Learning in Digital Health Kerry Weinberg
มุมมอง 527 หลายเดือนก่อน
MLconf NYC 2022: Expectations vs. Reality Machine Learning in Digital Health Kerry Weinberg
MLconf NYC 2022: Machine Learning for the Greater Good by Sherard Griffin, Marius Bogoevici, Red Hat
มุมมอง 407 หลายเดือนก่อน
MLconf NYC 2022: Machine Learning for the Greater Good by Sherard Griffin, Marius Bogoevici, Red Hat
MLconf NYC 2022: A Unique Approach to Discover Synthetic Identity Fraud by Cori Shen of Equifax
มุมมอง 1297 หลายเดือนก่อน
MLconf NYC 2022: A Unique Approach to Discover Synthetic Identity Fraud by Cori Shen of Equifax
MLconf NYC 2022: Building a Continuous Representation of Atomic Environment, Olga Kononova, Roivant
มุมมอง 207 หลายเดือนก่อน
MLconf NYC 2022: Building a Continuous Representation of Atomic Environment, Olga Kononova, Roivant
MLconf NYC 2022: Event Driven Machine Learning at Scale by Timothy Spann of StreamNative
มุมมอง 567 หลายเดือนก่อน
MLconf NYC 2022: Event Driven Machine Learning at Scale by Timothy Spann of StreamNative
MLconf NYC 2022: Hope and Failure for ML in Healthcare by Srinivas Sridhara of Optum Labs
มุมมอง 637 หลายเดือนก่อน
MLconf NYC 2022: Hope and Failure for ML in Healthcare by Srinivas Sridhara of Optum Labs
MLconf NYC 2022: Leveraging Text Mining to Extract Insights by Yuyu Fan of AllianceBernstein
มุมมอง 1497 หลายเดือนก่อน
MLconf NYC 2022: Leveraging Text Mining to Extract Insights by Yuyu Fan of AllianceBernstein
MLconf NYC 2022: How to Detect and Interpret Data Drift in Production by Emeli Dral of Evidently AI
มุมมอง 507 หลายเดือนก่อน
MLconf NYC 2022: How to Detect and Interpret Data Drift in Production by Emeli Dral of Evidently AI
MLconf NYC 2022: Model Invariants and Functional Regularization by Dr. Harvey J. Stein of Two Sigma
มุมมอง 287 หลายเดือนก่อน
MLconf NYC 2022: Model Invariants and Functional Regularization by Dr. Harvey J. Stein of Two Sigma
MLconf NYC 2022: Best Practices for a Scalable Enterprise ML Foundation by Abhijit Bose, Capital One
มุมมอง 697 หลายเดือนก่อน
MLconf NYC 2022: Best Practices for a Scalable Enterprise ML Foundation by Abhijit Bose, Capital One
Very useful talk. John is great
Has a functioning tool come out of it that's available somewhere? that can extract road edges/curbs?
the cure for cancer is: what source of energy can all cells survive on except cancer? Answer; Ketones. Solution: starve the cancer by depleting glucose via a low carb high fat diet. Many people have now cured their own cancer through diet alone.
Very well spek
This is mathematically so simple but yet so profound. Absolutely great!
It is not mathematically simple.
Is there any link to the example that was demonstrated?
great discussion
Can't the GMM be solved using SVM? What's the difference : separability vs higher order relationships?
Wow finally someone who understands this problem
David Brin predicted the AI empathy crisis with Google's LAMBDA.
Awesome talk - thanks
what happened to fakerfact ?
This is really well presented and provides high quality useful information on applications of AI.
Thanks for the video.
very interesting and informative presentation thanks
Annoying to hear this talk. Come on man. This is not race. There is no point in packing everything in 20 mins if the audience not going to understand it. Quality is important than quantity.
@9:20 Speaker says "I would probably not like the colleague's earthquake post" and I see Mark Zuckerburg liked that post. lol
Leslie Smith is a great speaker in addition to having contributed significantly to deep learning
where do you get the dataset please share the link
a 2% performance boost after all that?
when the scale of the problem is so large, even 1% improvement can have a large amount of impact.
Very good
Great talk!
FAIL!
This has some great insights and learnings. Thanks 🙏🏻☺️
Amazing work and interesting use of embeddings, with a strong privacy as profiles are not part of the training data. Thanks for sharing ! I'd be curious to know what is the current state of Tinder recommendation algorithm.
Thanks for such an insightful talk, par excellence
nice
awesome. he also gave a great talk on reinforcement learning, which I found on TH-cam.
my self prefer tensorflow
Thanks for this awesome pres !
i just cash out 15,000 USD paypal hack transfer from VASTLINKSHACK>COM guys their service are extremely awesome!!
Egonet
Although it was done in 2016, but still found useful nowadays
Interesting Idea of fairness optimization not much proof it works, also no explanation how optimally groups are formed that too involves fairness questions i.e formation of groups.
I think by groups in context of Spotify they mean artists, although I could imagine there being multiple level of groups, like genre groups and language groups
The point of feature engineering for machine learning, like classification, is a quite limited issue compared to the things the brain does. So it does not really make sense to oppose generic learning models to single purpose ML tasks. That's the whole point why we look at brains. Since nobody rejects that data-driven and prior must be combined, the question is *which* priors (or innate knowledge) and not *if*.
I'd love to know what Evan said while the audio cut out.
Very interesting and necessary work! Thanks for doing that
what happens when the funciton that we sample ist of stochastic instead of deterministic nature? would gaussian processes( or another surrogate model) and thus bayesian optimization have a way to deal with that? EDIT: im referring to what he scott states at around 13:10
pure gold
Presentation slides: www.slideshare.net/SessionsEvents/3-ewa-dominowska-managing-machine-learning-projects-in-industry
Hey @MLconf, shouldn't you mention the title of the paper in the title rather than where the prof works? Titles can be edited, so it applies to the past uploads as well.
I had fun while learning about learning rate scheduling in this talk. Nice talk!
2:20 cracked me out - we all have been there
Results in 18:55
Very useful lecture, thanks for uploading such videos
@MLconf is the airbnb talk going to be shared ?
is the airbnb talk going to be shared ?
How to predict multiple replies from the single question?
when will we se the Uberstock below 25% . Race to the bottom.
ok, just seeing this now and there are likely even better videos now available although this one is terrific. One nit.... can you give the cost function in python. i don't see the code for that.