fit_predict_proba

crowdkit.aggregation.classification.gold_majority_vote.GoldMajorityVote.fit_predict_proba | Source code

fit_predict_proba(
    self,
    data: DataFrame,
    true_labels: Series
)

Fits the model to the training data and returns probability distributions of labels for each task.

Parameters description

Parameters	Type	Description
`data`	DataFrame	The training dataset of workers' labeling results which is represented as the `pandas.DataFrame` data containing `task`, `worker`, and `label` columns.
`true_labels`	Series	The ground truth labels of tasks. The `pandas.Series` daata is indexed by `task` so that `labels.loc[task]` is the task ground truth label.

Returns:

Probability distributions of task labels. The pandas.DataFrame data is indexed by task so that result.loc[task, label] is the probability that the task true label is equal to label. Each probability is in he range from 0 to 1, all task probabilities must sum up to 1.
Return type:

DataFrame

Last updated: March 31, 2023

Crowd-Kit

Reference

Aggregation

Datasets

Learning

Metrics

Postprocessing