fit_predict_proba

crowdkit.aggregation.classification.majority_vote.MajorityVote.fit_predict_proba | Source code

fit_predict_proba(
    self,
    data: DataFrame,
    skills: Optional[Series] = None
)

Fits the model to the training data and returns probability distributions of labels for each task.

Parameters description

Parameters	Type	Description
`data`	DataFrame	The training dataset of workers' labeling results which is represented as the `pandas.DataFrame` data containing `task`, `worker`, and `label` columns.
`skills`	Optional[Series]	The workers' skills. The `pandas.Series` data is indexed by `worker` and has the corresponding worker skill.

Returns:

The probability distributions of task labels. The pandas.DataFrame data is indexed by task so that result.loc[task, label] is the probability that the task true label is equal to label. Each probability is in the range from 0 to 1, all task probabilities must sum up to 1.
Return type:

DataFrame

Last updated: March 31, 2023

Crowd-Kit

Reference

Aggregation

Datasets

Learning

Metrics

Postprocessing