Announcing Snorkel Flow!
We made an exciting announcement on snorkel.org today! We've reposted that message below: The Snorkel team is now focusing their efforts on Snorkel Flow, an end-to-end AI application development platform based on the core ideas behind Snorkel—check it out at snorkel.ai! The…
Struggle to put all the pieces together in the snorkel pipeline
Struggle to put all the pieces together in the snorkel pipeline Hello! I’m using snorkel in my master thesis on textual data to categorize comments into a set of different categories. I have gone through the tutorials on the snorkel tutorial github page but still struggles…
How to use Snorkel on numeric data?
Hi - I'm trying to see if I can use Snorkel to label my numeric training dataset (and then looking to either do label classification or convert the labels to scores and do a regression classification). My question is if there are coding examples anywhere that show how to use…
Widely varying results when training multi-task model on same inputs
Hello! I'm in the process of writing a paper comparing the efficacy of multi-task models with weak supervision to the performance of their single-task equivalents, and I've been using Snorkel v0.9.6 to explore this. However, I've been experiencing a strange issue in that when…
Why the probabilities are not higher in the positive label group?
Hi, I used LabelModel to label my train set, and I also obtained probability by using label_model.predict(L=L_train, tie_break_policy="abstain", return_probs=True). I assume that the probabilities should be relatively higher in the positive label group. However, that is not what…
Is it possible to assign weights on label functions
Hi, dear Snorkel community, I am relatively new to using Snorkel. My purpose is to use Snorkel to generate labels for a dataset we got. So far, we came up with dozens of rules, which are mainly based on domain knowledge of the data. Domain knowledge gave us hints that these…
Snorkel for Time Series Annotation
Hi everyone :) I was diving into snorkel methods, namely labeling functions and slicing functions (and using LabelModel) for labeling time series of IMU data (muck like the paper that got posted here, https://openreview.net/pdf?id=SJedYj5ruV), and I managed to get some labeling…
What is the best redundant labeling strategy to maximize the performance of…
I have 50,000 sentences I need labeled for a custom NER model, with triple redudancy. I can have as many workers as I want do the labeling. I will combine their labels using Snorkel's LabelModel. What is the best strategy to employ to give the LabelModel the best performance…