menu

snorkel

Building and managing training datasets for machine learning

Channels
# All channels
view-forward
# announcements
view-forward
# api
view-forward
# applications
view-forward
# help
view-forward
# projects
view-forward
# tutorials
view-forward
Team
Posts
Members
Info

How to use Snorkel on numeric data?

Hi - I'm trying to see if I can use Snorkel to label my numeric training dataset (and then looking to either do label classification or convert the labels to scores and do a regression classification). My question is if there are coding examples anywhere that show how to use…

thumbsup
0
message-simple
1

Widely varying results when training multi-task model on same inputs

Hello! I'm in the process of writing a paper comparing the efficacy of multi-task models with weak supervision to the performance of their single-task equivalents, and I've been using Snorkel v0.9.6 to explore this. However, I've been experiencing a strange issue in that when…

thumbsup
0
message-simple
2

How to import the GenerativeModel?

I am following this tutorial(https://www.youtube.com/watch?v=2bhBe9HGuSQ) by @jfries for training a generative model, in the notebook shown in video, the following import is used from snorkel.learning import GenerativeModel, but when I execute the following command, I get the…

thumbsup
0
message-simple
13

Is it possible to assign weights on label functions

Hi, dear Snorkel community, I am relatively new to using Snorkel. My purpose is to use Snorkel to generate labels for a dataset we got. So far, we came up with dozens of rules, which are mainly based on domain knowledge of the data. Domain knowledge gave us hints that these…

thumbsup
0
message-simple
3

Why the probabilities are not higher in the positive label group?

Hi, I used LabelModel to label my train set, and I also obtained probability by using label_model.predict(L=L_train, tie_break_policy="abstain", return_probs=True). I assume that the probabilities should be relatively higher in the positive label group. However, that is not what…

thumbsup
0
message-simple
4

Snorkel for Time Series Annotation

Hi everyone :) I was diving into snorkel methods, namely labeling functions and slicing functions (and using LabelModel) for labeling time series of IMU data (muck like the paper that got posted here, https://openreview.net/pdf?id=SJedYj5ruV), and I managed to get some labeling…

thumbsup
0
message-simple
1

How to use the mdoel probabilities output

I'm trying to use Snorkel for labeling social media text as topics. In the tutorial it says "As we saw in Section 4, the LabelModel outputs probabilistic (float) labels. If the classifier we are training accepts target labels as floats, we can train on these labels directly" …

thumbsup
0
message-simple
1

Training with ground-truth labeled and Snorkel-labeled data

Background: I'm writing a conference paper on some Snorkel work, so I want to make sure I understand why Snorkel is considered a weak supervision approach, but isn't considered a semi-supervised learning approach, even though many tutorials leverage some ground truth data in…

thumbsup
1
message-simple
2

Shortlisting large number of labelling functions

I am trying to use snorkel in the context of credit risk modelling (binary classification). I have written lots of labeling functions. I do have some ground truth available so I want to inform labeling functions somehow. The question is: which ground-truth related metrics is it…

thumbsup
0
message-simple
1

Snorkel for unbalanced datasets

Hi, I am new to Snorkel and facing some issues. I have to classify the dataset into two classes. The dataset is imbalanced(25 positive for 300 negatives). The labeling functions used also generate an imbalanced dataset as expected. After using Snorkel, all the labels are…

thumbsup
0
message-simple
11