Université Lumière Lyon 2
AI

Generative AI methods to model Printing-and-Digitalization process

Université Lumière Lyon 2 · Lyon, ARA, FR · $135k

Actively hiring Posted 5 months ago

Generative AI methods to model Printing-and-Digitalization process

Réf ABG-135313

Stage master 2 / Ingénieur

Durée 4 mois

Salaire net mensuel 4.5 euros per hour

28/01/2026

Université Lumière Lyon 2

Lieu de travail

Lyon Auvergne-Rhône-Alpes France

Champs scientifiques

  • Informatique

Mots clés

Generative learning, image processing, printed documents

Date limite de candidature

28/02/2026

Établissement recruteur

Founded in 1973, Université Lumière Lyon 2 welcomes nearly 30,000 students on its two campuses, ranging from undergraduate to doctoral level.

As a university of literature, languages, and human and social sciences, it is comprised of 13 teaching units spread over four main areas of teaching and research.

With 33 laboratories and four research federations, which cover the areas of literature, languages, and human and social sciences (LLSHS – Lettres, Langues, Sciences Humaines et Sociales), Université Lumière Lyon 2 bases its approach on innovation, interdisciplinarity, partnership and an international outlook.

Through the projects developed and coordinated by its 1000 researchers, the university would like to enable communication and discussion between the human and social sciences, on one hand, and the hard sciences, on the other, as well as to put research at the centre of current societal and scientific challenges.

Université Lumière Lyon 2 has a strong focus on international cooperation and currently has agreements with 350 institutions throughout the world. International students, whether part of an exchange programme or otherwise, account for more than 15% of the overall student body.

Description

Context of the study

Due to development and broad availability of high-quality printing and scanning devices, the number of forged or counterfeited products and documents is dramatically increasing. Therefore, there is continuous research of new solutions to protect the documents and valuable products. One of the promising and accessible solution is the use of security printing. When an electronic document is printed and scanned one/several times, a slightly different image of the document - due to the optical characteristics of the capture devices - is obtained each time. We can use this information loss principle to identify the devices used for production of the document hardcopy.

In this case, the authentication/identification systems take a full advantage of Printing-and-Digitalization (PD) process. In current stage, we need to print a big quantity of samples to learn the authentication detectors or forgery detectors. Nevertheless, the dataset construction process is very expensive and time-consuming. Additionally, the data collection process requires dedicated personnel and very strict procedures. Therefore, we are looking for solutions that can help to learn a surrogate model of PD process and create a big public dataset of printed documents.

This internship is a part of ANR project TRUSTIT: Theoretical and practical study of physical object security in real world use cases that aims to explore the potential offered by deep learning methods in the context of secure printing from the verifier's point of view.

Description of the subject

The generative neural networks (GANs) and probabilistic latent diffusion models recently have showed their efficiency in data generation and style transfer [1,2]. The main tasks of this internship are:

1) To learn a surrogate representation of one pair printer-scanner using existing large dataset of L3iTextCopies [3].

2) To experiment with different architectures of GANs and probabilistic diffusion approaches to identify the best method for our task.

3) To compare the pseudo-synthetic samples with real printed documents using some commonly used metrics as Pearson correlation, Mean square error (MSE) distance and Fréchet Inception Distance (FID) between the datasets [4].

4) To evaluate the possibility of fine-tuning the proposed models for unseen pairs of printer and scanner.

5) To create a public synthetic dataset of printed documents and if possible, to publish the results in the international conference or scientific journal.

Place and allowance of internship

The internship will be held in LIRIS (Laboratoire d’Informatique en Image et Systèmes d’information) laboratory, campus of Université Lumière Lyon 2, Bron. Internship allowance is 4.5 euros per hour.

References

[1] R. Yadav, I. Tkachenko, A. Trémeau, T. Fournel, Copy Sensitive Graphical Code Estimation: Physical vs Numerical Resolution, IEEE WIFS 2019.

[2] R. Ratajczak, C. Crispim-Junior, and al., Pseudo-Cyclic Network for Unsupervised Colorization with Handcrafted Translation and Output Spatial Pyramids, SUMAC - ACM Multimedia 2019.

[3] S. Eskenazi, P. Gomez-Krämer, J-M. Ogier, A Study of the Factors Influencing OCR Stability for Hybrid Security. IWCDF-ICDAR 2017: 3-8.

[4] Y. Belousov, B. Pulfer, R. Chaban, J. Tutt, O. Taran, T. Holotyak, S. Voloshynovskiy, “Digital twins of physical printing-imaging channel”, IEEE WIFS 2022.

Profil

  • The candidate must currently be enrolled in a Master 1 or Master 2 program or in the final year of engineering school (that corresponds to Bac+5 in France) in Computer Science.
  • Programming languages: Python.
  • Libraries for image analysis and processing: OpenCV, scikit-image (Python).
  • Machine learning frameworks: scikit-learn, PyTorch.
  • Scientific knowledge: signal processing, image analysis, machine learning and deep learning.
  • Knowledge on multimedia security will be considered a plus.
  • Languages: French or English.

Prise de fonction

01/04/2026

Tags & focus areas

Used for matching and alerts on DevFound
Ai Generative Ai
Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.