Toloka documentation

fit_predict_proba

crowdkit.aggregation.classification.majority_vote.MajorityVote.fit_predict_proba | Source code

fit_predict_proba(
    self,
    data: DataFrame,
    skills: Optional[Series] = None
)

Fit the model and return probability distributions on labels for each task.

Parameters Description

Parameters Type Description
data DataFrame

Workers' labeling results. A pandas.DataFrame containing task, worker and label columns.

skills Optional[Series]

workers' skills. A pandas.Series index by workers and holding corresponding worker's skill

  • Returns:

    Tasks' label probability distributions. A pandas.DataFrame indexed by task such that result.loc[task, label] is the probability of task's true label to be equal to label. Each probability is between 0 and 1, all task's probabilities should sum up to 1

  • Return type:

    DataFrame