Auto-translation used

Read the original

Как AI помогает модерации в Kolesa Group и причём здесь инопланетяне

Как компании могут поручить монотонные задачи AI, чтобы оптимизировать работу сотрудников и помочь бизнесу масштабироваться

Спам, мошенники, запрещённые товары и услуги, шуточные или некорректные объявления отпугнут любого пользователя от сервиса или приложения. Бизнес, который хочет расширять клиентскую базу, делает всё, чтобы до пользователя доходила только релевантная и нужная информация.

Сердце IT-продуктов Kolesa Group — это объявления о продаже и покупке авто, недвижимости, товаров и услуг. Ежедневно пользователи компании подают 83 тысячи объявлений, которые нужно проверять на корректность. Кто и как обрабатывает такой огромный поток информации?

Мы поговорили с Индирой Альденовой — руководителем отдела модерации Kolesa Group. Отдел состоит из 39 человек. Каждый год поток объявлений увеличивается, бизнес растёт, а отдел модерации численно прирастает минимально. Как им удаётся масштабироваться и эффективнее бороться с некорректными объявлениями, не раздувая при этом штат?Взяв за пример свой опыт, она расскажет о внедрении AI (искусственный интеллект) в модерацию и о том, какие результаты им это принесло. Далее с её слов.

Как работает ручная модерация и почему это важно

Проверяется всё, что поступает на сервис: объявления, отзывы, комментарии, кабинеты и заявки от специалистов.

Ручная модерация работает по классической схеме.

1. Объявление попадает на проверку к модератору. Он задается вопросами:

- Есть ли на этой фотографии что-то, чего не должно быть видно?

- Есть ли в этом сообщении запрещённые слова или приватная информация? и т.д.

2. Если нарушения обнаружены, модератор может принять решение, как их исправить или отклонить.

Казалось бы, ручная модерация — это проще и дешевле, но дело не в экономической выгоде. Главное в ручной модерации — накопление исторических данных и выстраивание системы работы с этими данными, которые лягут в основу будущей ML-модели.

Совет 1
Собирайте данные в дата-сеты с самого начала. Потому что на их базе вы сможете построить ML-модели. Советуем собрать минимум 10 000 дата-поинтов, чтобы начать работу над ML. Дата-поинт — это фото, текст или любая другая единица данных.

Как устроена автомодерация

Благодаря чётко выстроенной работе с ручной модерацией, мы не строим с нуля процесс автомодерации — он идёт параллельно.

Мы размечаем данные для ML-моделей. Зачем: чем больше модель «увидит» примеры того, что мы считаем правильным или неправильным, тем лучше обучится. И сможет в будущем забирать значительную часть работы на себя.

Автомодерация осуществляется по тексту и фото.

Текстовая автомодерация актуализируется списком стоп-слов и выбором параметров. Например:

- Одна модель определяет, правильно ли указана категория в объявлении, чтобы в авто не подавали запчасти;

- Другая модель отвечает за проверку текста на наличие нежелательных слов в объявлении;

- Третья модель отвечает за адекватность цены в предложении. Ведь за дешёвыми объявлениями часто скрываются мошенники. И так далее.

Фотомодерация состоит из 8 разных ML-моделей: люди, скриншоты, дубликаты, оружие и т.д. Логика её работы:

1. Пользователь подаёт объявление.

2. Автомодерация скачивает все фотографии этого объявления.

3. Определяет очерёдность модели для проверки фотографий.

4. Если на каком-то этапе модель найдёт ошибку на фото, то дальнейшая проверка прекращается и выдаётся результат.

Возможные варианты ответов:

- опубликовать

- отправить на ручную модерацию.

Автомодерация не отклоняет фото самостоятельно.

Но AI не всегда понимает, что делать с объявлением. В таком случае объявление отправляется на ручную модерацию. Автомодерация забирает на себя 85% всего потока объявлений.

Совет 2
Делайте подсказки от автомодерации исчерпывающими и понятными. Благодаря этим подсказкам, например, «Обнаружены животные», мы понимаем, что именно смутило ИИ при проверке. Чем информативнее автомодерация будет отправлять объявления на ручную модерацию, тем эффективнее пройдёт процесс сбора данных для переобучения моделей.

Рабочие кейсы

Рассмотрим работу автомодерации на примере продукта Kolesa.kz. Основные факапы автомодерации приходятся на фото:

Кейс 1 — НЛО

Недавно у нас было «нашествие инопланетян». Пользователи подавали шуточные объявления, и фотки завирусились в соцсетях. Причина — фотографии, которые AI не может распознать.

Кейс 2 — AI распознаёт запчасти как животные, оружие.

Такие кейсы мы собираем и забираем на переобучение модели.

Оптимизация ресурсов команды за счёт автоматизации

Основной фокус внедрения автомодерации — освобождение ресурсов команды для:

•‎ более тщательной проверки подозрительных объявлений для защиты пользователей от мошенников;

•‎ повышения качества объявлений;

•‎ более быстрой и качественной обработки обращений от пользователей.

Благодаря автоматизации модераторы смогли сфокусироваться на более интеллектуальном труде. А именно:

1. Помогают анализировать показатели модерации для разбора кейсов и повышения эффективности.

2. Проводят мастер-классы, воркшопы, обмены опытом для того, чтобы качественных объявлений было как можно больше.

3. Повышают скорость проверки объявлений

4. Участвуют в программах роста и развития. Это даёт им возможность расти в тимлиды или осуществить карьерный переход в другое IT-направление.

Качество ручной модерации — 99,9%

Качество автомодерации — 99,4%.

В продуктах Kolesa Group крутится 60+ моделей. Работа над переобучением ML-моделей, а также повышением качества и эффективности автомодерации ведётся постоянно.

Итоги

1. Сократили время проверки объявлений с 15 минут до пары секунд.

2. Сократили время ручной модерации с 15 до 10 минут.

3. Снизили нагрузку на модераторов — автомодерация забирает на себя 85% текста и фото.

4. За счет автоматизации рутинных процессов смогли перераспределить ресурсы на другие задачи.

Планы

1. Улучшение качества автомодерации.

2. Разбор ошибок, переобучение модели за счет разметки изображений/текста.

3. Обучение основным триггерам с объявлениями от мошенников, отправлять на ручную модерацию для дополнительной проверки.

Внедряя AI в свои процессы, важно помнить, что поддержка его эффективности — это постоянный процесс анализа и проработки ошибок.

How does AI help moderation in the Kolesa Group and what does aliens have to do with it

How can companies assign monotonous AI tasks to optimize employee performance and help businesses scale

Spam, scammers, prohibited goods and services, funny or incorrect ads will scare any user away from the service or application. A business that wants to expand its customer base does everything to ensure that only relevant and necessary information reaches the user.

The heart of the IT products of the Kolesa Group are ads for the sale and purchase of cars, real estate, goods and services. Every day, the company's users submit 83 thousand ads that need to be checked for correctness. Who processes such a huge flow of information and how?

We talked with Indira Aldenova, head of the moderation department at the Kolesa Group. The department consists of 39 people. Every year, the flow of ads increases, the business grows, and the moderation department grows numerically minimally. How do they manage to scale up and deal more effectively with incorrect ads without inflating the staff?Taking her experience as an example, she will talk about the introduction of AI (artificial intelligence) into moderation and what results it brought them. Further from her words.

Everything that comes to the service is checked: ads, reviews, comments, cabinets and applications from specialists.

Manual moderation works according to the classical scheme.

1. The ad gets checked by the moderator. He's asking questions:

- Is there anything in this photo that shouldn't be visible?

- Is there any forbidden words or private information in this message? etc.

2. If violations are found, the moderator can decide how to correct or reject them.

It would seem that manual moderation is easier and cheaper, but it's not about economic benefits. The main thing in manual moderation is the accumulation of historical data and building a system for working with this data, which will form the basis of a future ML model.

Tip 1 Collect data in data sets from the very beginning. Because you can build ML models based on them. We recommend collecting at least 10,000 data points to start working on ML. A data point is a photo, text, or any other unit of data.

Thanks to the well—structured work with manual moderation, we do not build the self-moderation process from scratch - it goes in parallel.

We mark up data for ML models. Why: the more the model "sees" examples of what we think is right or wrong, the better it will learn. And he will be able to take a significant part of the work on himself in the future.

Self-moderation is carried out by text and photo.

Text self-moderation is updated with a list of stop words and a selection of parameters. For example:

- One model determines whether the category is indicated correctly in the ad so that spare parts are not supplied to the car;

- Another model is responsible for checking the text for unwanted words in the ad;

- The third model is responsible for the adequacy of the price in the offer. After all, scammers often hide behind cheap ads. Etc.

Photomoderation consists of 8 different ML models: people, screenshots, duplicates, weapons, etc. The logic of its work:

1. The user submits an ad.

2. Auto-moderation downloads all photos of this ad.

3. Determines the order of the model to check the photos.

4. If at some stage the model finds an error in the photo, then further verification is stopped and the result is given.

Possible answers:

- publish

- submit for manual moderation.

Self-moderation does not reject photos on its own.

But the AI doesn't always understand what to do with the ad. In this case, the ad is sent for manual moderation. Self-moderation takes over 85% of the entire ad stream.

Tip 2 Make the suggestions from self-moderation comprehensive and understandable. Thanks to these hints, for example, "Animals detected", we understand exactly what confused the AI during the check. The more informative the self-moderation will be to send ads for manual moderation, the more effective the data collection process will be for retraining models.

Let's look at the work of self-moderation using the example of a product Kolesa.kz . The main facaps of self-moderation are in the photo:

Case 1 — UFOs

Recently we had an "alien invasion". Users submitted comic ads, and photos were posted on social networks. The reason is photos that the AI cannot recognize.

Case 2 — AI recognizes spare parts as animals, weapons.

We collect such cases and take them to retrain the model.

The main focus of the implementation of self—moderation is the release of team resources for:

• More thorough verification of suspicious ads to protect users from scammers;

• improve the quality of ads;

• faster and better processing of requests from users.

Thanks to automation, moderators were able to focus on more intellectual work. Exactly:

1. They help to analyze moderation indicators to analyze cases and improve efficiency.

2. They conduct master classes, workshops, and exchanges of experience in order to have as many high-quality ads as possible.

3. Increase the speed of checking ads

4. Participate in growth and development programs. This gives them the opportunity to grow into team leaders or make a career transition to another IT direction.

The quality of manual moderation is 99.9%

The quality of self—moderation is 99.4%.

There are 60+ models in the products of the Kolesa Group. Work on retraining ML models, as well as improving the quality and efficiency of self-moderation is ongoing.

1. Reduced the time for checking ads from 15 minutes to a couple of seconds.

2. Reduced the time of manual moderation from 15 to 10 minutes.

3. Reduced the load on moderators — self-moderation takes over 85% of the text and photos.

4. By automating routine processes, we were able to reallocate resources to other tasks.

1. Improving the quality of self-moderation.

2. Error analysis, retraining of the model by marking up images / text.

3. Training on the main triggers with ads from scammers, send them to manual moderation for additional verification.

When implementing AI into your processes, it is important to remember that maintaining its effectiveness is a constant process of analyzing and working through errors.

2976

История Очистить

Popular posts

How we prepared Aurma for growth: about the infrastructure on Yandex Cloud, which is not visible, but which is important

Виктория Унгурян
Aug. 11, 2025

Artificial intelligence: a smart assistant or a challenge to the future of education?

ЕРЛАН АБДРАИМОВ
Aug. 1, 2025

The most sought-after programming languages in 2025

Кирилл Коваленко
July 15, 2025

Education in the 21st century: why we study and how it is changing

ЕРЛАН АБДРАИМОВ
Aug. 1, 2025

Как AI помогает модерации в Kolesa Group и причём здесь инопланетяне

Как работает ручная модерация и почему это важно

Как устроена автомодерация

Рабочие кейсы

Оптимизация ресурсов команды за счёт автоматизации

Итоги

Планы

How does AI help moderation in the Kolesa Group and what does aliens have to do with it

Shudanbekov Zhiger
Dec. 28, 2023 08:55

Comments 0

Laura Meir · Sept. 17, 2024 00:52

Имангали Тасмагамбетов · Feb. 1, 2024 14:07

Даниял · Jan. 19, 2024 11:54

Даниял · Jan. 19, 2024 11:54

Даниял · Jan. 19, 2024 11:54

Даниял · Jan. 19, 2024 11:54

Даниял · Jan. 19, 2024 11:54

Даниял · Jan. 19, 2024 11:54

Даниял · Jan. 19, 2024 11:54

ТОО ASTROBAKERS · Jan. 12, 2024 12:10

Tanat Uskembayev · Jan. 4, 2024 08:08

Gulmira Kanafina · Dec. 28, 2023 08:55

Popular posts

How we prepared Aurma for growth: about the infrastructure on Yandex Cloud, which is not visible, but which is important

Виктория Унгурян Aug. 11, 2025

Artificial intelligence: a smart assistant or a challenge to the future of education?

ЕРЛАН АБДРАИМОВ Aug. 1, 2025

The most sought-after programming languages in 2025

Кирилл Коваленко July 15, 2025

Education in the 21st century: why we study and how it is changing

ЕРЛАН АБДРАИМОВ Aug. 1, 2025

Как AI помогает модерации в Kolesa Group и причём здесь инопланетяне

Как работает ручная модерация и почему это важно

Как устроена автомодерация

Рабочие кейсы

Оптимизация ресурсов команды за счёт автоматизации

Итоги

Планы

How does AI help moderation in the Kolesa Group and what does aliens have to do with it

Shudanbekov Zhiger Dec. 28, 2023 08:55

Comments 0

Laura Meir · Sept. 17, 2024 00:52

Имангали Тасмагамбетов · Feb. 1, 2024 14:07

Даниял · Jan. 19, 2024 11:54

Даниял · Jan. 19, 2024 11:54

Даниял · Jan. 19, 2024 11:54

Даниял · Jan. 19, 2024 11:54

Даниял · Jan. 19, 2024 11:54

Даниял · Jan. 19, 2024 11:54

Даниял · Jan. 19, 2024 11:54

ТОО ASTROBAKERS · Jan. 12, 2024 12:10

Tanat Uskembayev · Jan. 4, 2024 08:08

Gulmira Kanafina · Dec. 28, 2023 08:55

Виктория Унгурян
Aug. 11, 2025

ЕРЛАН АБДРАИМОВ
Aug. 1, 2025

Кирилл Коваленко
July 15, 2025

ЕРЛАН АБДРАИМОВ
Aug. 1, 2025

Shudanbekov Zhiger
Dec. 28, 2023 08:55