Long-Term Internship Program: Vision-Language Model Development Intern(ICL_W1/W2)
This internship is in ITRI's Information and Communications Research Laboratories (ICL).
This is an exciting long-term internship opportunity. We’re looking for candidates who can commit to around 6 months. You’ll join a dynamic team, gain hands-on experience, and work on impactful projects that will help you develop valuable skills for your future career.
Job Description:
1、Data Collection and Integration: Assist in collecting and integrating visual-language model (VLM) training data, including images, text, and corresponding descriptions, ensuring data diversity and quality.
2、Data Cleaning and Processing: Clean and format collected data to meet training requirements and ensure consistency.
3、Data Annotation: Perform semantic annotation of images and text according to established guidelines, supporting the accuracy of cross-modal learning.
4、Data Verification: Check the accuracy of annotated data, assist in identifying and correcting potential errors, and improve data quality.
5、Model Training Support: Assist in running and testing pre-trained VLM models, provide feedback based on the data characteristics, and contribute to model optimization.
Qualifications:
1、Current undergraduate or graduate student, preferably in Computer Science, Artificial Intelligence, or related fields.
2、Proficiency in Python programming.
3、Basic understanding of the data annotation process. Prior experience in image or text annotation is a plus, with the ability to follow annotation guidelines and ensure accuracy.
4、Familiarity with deep learning frameworks (e.g., TensorFlow, PyTorch) is a plus.
5. The required documents are a transcript and a recommendation letter.