fit_predict_proba

crowdkit.aggregation.classification.majority_vote.MajorityVote.fit_predict_proba | Source code

fit_predict_proba(
self,
data: DataFrame,
skills: Optional[Series] = None
)

Fits the model to the training data and returns probability distributions of labels for each task.

Parameters description

ParametersTypeDescription
dataDataFrame

The training dataset of workers' labeling results which is represented as the pandas.DataFrame data containing task, worker, and label columns.

skillsOptional[Series]

The workers' skills. The pandas.Series data is indexed by worker and has the corresponding worker skill.

  • Returns:

    The probability distributions of task labels. The pandas.DataFrame data is indexed by task so that result.loc[task, label] is the probability that the task true label is equal to label. Each probability is in the range from 0 to 1, all task probabilities must sum up to 1.

  • Return type:

    DataFrame

Last updated: March 31, 2023

Crowd-Kit
Overview
Reference
Aggregation
Datasets
Learning
Metrics
Postprocessing