crowdkit.aggregation.classification.gold_majority_vote.GoldMajorityVote.fit_predict_proba | Source code
fit_predict_proba(self,data: DataFrame,true_labels: Series)
Fits the model to the training data and returns probability distributions of labels for each task.
The training dataset of workers' labeling results which is represented as the
The ground truth labels of tasks. The
Probability distributions of task labels.
pandas.DataFrame data is indexed by
task so that
result.loc[task, label] is the probability that the
task true label is equal to
Each probability is in he range from 0 to 1, all task probabilities must sum up to 1.