ICL Intern_Vision-Language Model Development (W100)
This internship is in ITRI's Information and Communications Research Laboratories (ICL).
Job Description
1. Data Collection and Integration: Assist in collecting and integrating visual-language model (VLM) training data, including images, text, and corresponding descriptions, ensuring data diversity and quality.
2. Data Cleaning and Processing: Clean and format collected data to meet training requirements and ensure consistency.
3. Data Annotation: Perform semantic annotation of images and text according to established guidelines, supporting the accuracy of cross-modal learning.
4. Data Verification: Check the accuracy of annotated data, assist in identifying and correcting potential errors, and improve data quality.
5. Model Training Support: Assist in running and testing pre-trained VLM models, provide feedback based on the data characteristics, and contribute to model optimization.
Qualifications:
1. Current undergraduate or graduate student, preferably in Computer Science, Artificial Intelligence, or related fields.
2. Proficiency in Python programming.
3. Basic understanding of the data annotation process. Prior experience in image or text annotation is a plus, with the ability to follow annotation guidelines and ensure accuracy.
4. Familiarity with deep learning frameworks (e.g., TensorFlow, PyTorch) is a plus.