crowdkit.aggregation.classification.gold_majority_vote.GoldMajorityVote.fit_predict_proba
| Source code
fit_predict_proba( self, data: DataFrame, true_labels: Series)
Fits the model to the training data and returns probability distributions of labels for each task.
Parameters | Type | Description |
---|---|---|
data | DataFrame | The training dataset of workers' labeling results which is represented as the |
true_labels | Series | The ground truth labels of tasks. The |
Returns:
Probability distributions of task labels.
The pandas.DataFrame
data is indexed by task
so that result.loc[task, label]
is the probability that the task
true label is equal to label
.
Each probability is in he range from 0 to 1, all task probabilities must sum up to 1.
Return type:
DataFrame
Last updated: March 31, 2023