fit_predict_proba

crowdkit.aggregation.classification.gold_majority_vote.GoldMajorityVote.fit_predict_proba | Source code

fit_predict_proba(
self,
data: DataFrame,
true_labels: Series
)

Fits the model to the training data and returns probability distributions of labels for each task.

Parameters description

ParametersTypeDescription
dataDataFrame

The training dataset of workers' labeling results which is represented as the pandas.DataFrame data containing task, worker, and label columns.

true_labelsSeries

The ground truth labels of tasks. The pandas.Series daata is indexed by task so that labels.loc[task] is the task ground truth label.

  • Returns:

    Probability distributions of task labels. The pandas.DataFrame data is indexed by task so that result.loc[task, label] is the probability that the task true label is equal to label. Each probability is in he range from 0 to 1, all task probabilities must sum up to 1.

  • Return type:

    DataFrame

Last updated: March 31, 2023

Crowd-Kit
Overview
Reference
Aggregation
Datasets
Learning
Metrics
Postprocessing