Image Captioning using CLIP and Marian
How we combined CLIP and Marian to build an image captioning model for Indonesian.
How we combined CLIP and Marian to build an image captioning model for Indonesian.
How we combined CLIP and Marian to build an image captioning model for Indonesian.
Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text.
Building this static site generator theme was the first time I used an Atomic (or Functional) CSS system like Tachyons. It’s a design system that provides very small (which means fast) CSS modules that you can use in your HTML.
Building this static site generator theme was the first time I used an Atomic (or Functional) CSS system like Tachyons. It’s a design system that provides very small (which means fast) CSS modules that you can use in your HTML.
Language Model is a model that computes probability of a sentence (sequence of words) or the probability of a next word in a sequence.
Machine Translation is the task of automatically converting one natural language into another, preserving the meaning of the input text, and producing fluent text in the output language.
Building Systems that automatically answer questions posed by humans in a natural language
Text Classification is the processing of labeling or organizing text data into groups. It forms a fundamental part of Natural Language Processing.
The task of producing a shorter version of one or several documents that preserves most of the input's meaning
The task of producing a shorter version of one or several documents that preserves most of the input's meaning
The Token classification Task is similar to text classification, except each token within the text receives a prediction. A common use of this task is Named Entity Recognition (NER).
Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text.
Language Model is a model that computes probability of a sentence (sequence of words) or the probability of a next word in a sequence.
Machine Translation is the task of automatically converting one natural language into another, preserving the meaning of the input text, and producing fluent text in the output language.
Text Classification is the processing of labeling or organizing text data into groups. It forms a fundamental part of Natural Language Processing.
The Token classification Task is similar to text classification, except each token within the text receives a prediction. A common use of this task is Named Entity Recognition (NER).
Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text.
Language Model is a model that computes probability of a sentence (sequence of words) or the probability of a next word in a sequence.
Machine Translation is the task of automatically converting one natural language into another, preserving the meaning of the input text, and producing fluent text in the output language.
Text Classification is the processing of labeling or organizing text data into groups. It forms a fundamental part of Natural Language Processing.
The Token classification Task is similar to text classification, except each token within the text receives a prediction. A common use of this task is Named Entity Recognition (NER).