menu
announcement

Spectrum is now read-only. Learn more about the decision in our official announcement.

snorkel

Building and managing training datasets for machine learning

Channels
Team
Posts
Members
Info
down-caret

Announcing Snorkel Flow!

We made an exciting announcement on snorkel.org today! We've reposted that message below: The Snorkel team is now focusing their efforts on Snorkel Flow, an end-to-end AI application development platform based on the core ideas behind Snorkel—check it out at snorkel.ai! The…

message-simple
6

Feedback on how to further improve a data labeling model using snorkel

Hi, I am working on a project to programmatically label messages as spam or ham (not spam) on this sms dataset (https://paperswithcode.com/dataset/sms-spam-collection-data-set). By referring to the snorkel github tutorial (which I found really helpful), I have been able to…

message-simple
0
rohi0007
·79d
# help

Trying to create dataset for creating a model for visual relationship Detection

Hi All, I am trying to solve use case for which we have to create a model but whatever VRD datasets and predicates are available that is not solving our use case. So is it possible to create a new datasets in the same format subject, object and predicates, bbox. …

message-simple
6

Data Augmentation in structured categorical dataset.

I am working with categorical dataset such as 'Adult'. I want to augment the data using weak supervision. Any leads will be helpful.

message-simple
3

Struggle to put all the pieces together in the snorkel pipeline

Struggle to put all the pieces together in the snorkel pipeline Hello! I’m using snorkel in my master thesis on textual data to categorize comments into a set of different categories. I have gone through the tutorials on the snorkel tutorial github page but still struggles…

message-simple
2

Data Slicing

Hi! I applied the labeling and transformation functions on my 3 million of sentences. Now, I’m trying to apply the SliceAwareClassifier but it needs more than 12GB of RAM for only 5000 sentences, is it normal? Are there other models to improve the slice performance?

message-simple
1

How to use Snorkel on numeric data?

Hi - I'm trying to see if I can use Snorkel to label my numeric training dataset (and then looking to either do label classification or convert the labels to scores and do a regression classification). My question is if there are coding examples anywhere that show how to use…

message-simple
1

Widely varying results when training multi-task model on same inputs

Hello! I'm in the process of writing a paper comparing the efficacy of multi-task models with weak supervision to the performance of their single-task equivalents, and I've been using Snorkel v0.9.6 to explore this. However, I've been experiencing a strange issue in that when…

message-simple
2

Where to find this util package?

I am trying to follow this https://www.snorkel.org/get-started/. I am not able to locate from utils import load_unlabeled_spam_dataset. Can someone guide me on where to get this package? Appreciate your help and time.

message-simple
2

Why the probabilities are not higher in the positive label group?

Hi, I used LabelModel to label my train set, and I also obtained probability by using label_model.predict(L=L_train, tie_break_policy="abstain", return_probs=True). I assume that the probabilities should be relatively higher in the positive label group. However, that is not what…

message-simple
4

Is it possible to assign weights on label functions

Hi, dear Snorkel community, I am relatively new to using Snorkel. My purpose is to use Snorkel to generate labels for a dataset we got. So far, we came up with dozens of rules, which are mainly based on domain knowledge of the data. Domain knowledge gave us hints that these…

message-simple
3