A young woman with a water pitcher
Outline
- Painting information
- Meaning of the painting
- Details in the painting
- The face, the girl
- The wall map
- The pitcher and basin
- The window
Painting information
-
This picture of the early to mid-1660s is characteristic of Vermeer’s mature style.
-
Vermeer gets a quiet balance of primary colors and simple shapes
-
The composition suits the theme of domestic, tranquility, underscored by the pitcher, traditional symbol of purity.
-
This canvas was the first of thirteen paintings by Vermeer to enter the United States between 1887 and 1919.
The meaning
- The difference between outside and inside.
- The window and the room.
- The marriage and the freedom.
- The window -> freedom
- The room -> marriage
The face
- She is somewhat inexpressive.
- A similar white cap in other paintings.
- The most characteristic -> blue colour of the headdress´ shadows.
- Neutral gray.
The wall map
- The Seventeen Provinces of the Netherlands in 1671.
- Real map.
- Vermeer intrduces a few decorative themes to give it a new look.
The pitcher and basin
- It plays a fundamental role in the composition. Maps were used as a decorative element in Dutch houses.
- The realism of both of them in the house life.
- This model of pitcher could be owned by Vermeer´s mother in law. A present for her Daugther.
The window
- It seems to be the same as in the “Music Lesson”
- Some observers believe that the girl is watering the flowers.
- The glass of the window is broken down to make a mosaic.
Web pages information
- www.essentialvermeer.com
- www.terraingallery.org/ Vermeer-Woman-Jug- JJ-RJ.htm
CREA & CORDE
CREA It is a Corpus of Reference of the current Spanish.It has all the variantions which Spanish can have nowadays.This corpus has a mixture of written and oral texts. These texts are from 1975 until our age.- Its structure of work is the following:
- It determines the dimension of the corpus
- It can classify texts in a chronological way, spatial, etc.
- Acquisition of texts
- It can classify texts:
CORDE What made that RAE created this corpus, CORDE, was the good results obtained guring the first months of its project. CORDE consists on something more of 300 million forms which proceeded from texts from the origin of language until 1974. Bibliography http://corpus.rae.es/cordenet.html http://corpus.rae.es/ayuda_c.htm http://www4.ujaen.es/~ncontrer/interele/links.htm http://216.239.59.104/search?q=cache:vYr6ppD3loUJ:paginaspersonales.deusto.es/abaitua/konzeptu/ta/soria00.ppt+crea+corde&hl=es&ct=clnk&cd=9&gl=es
Draft of the final project
My group is the A one and we chose a topic ” Review of Spanish Corpora” to make it.
Firstly, we had to find information about what was a corpus and how the use. With this information we made two parts in the final project: a theoric part, explaining what and how a corpus works and a parctical part, where we have put this information.
- Theoric part_ We had writen about the porpuses of a corpus, different corpuses
- Practical part_ We explain how the Copus works with the Mark Davies´s Corpus application and with the RAE Corpus.
Cluvi
The French-Galician CLUVI Dictionary, according to its web page is ”developed by the SLI (Computational Linguistics Group of the University of Vigo), is a bilingual dictionary based on a representative corpus of French texts with their Galician translation. That parallel corpus is part of the Linguistic Corpus of the University of Vigo (CLUVI). All the French words in the dictionary are found in the CLUVI Corpus, as well as all of the Galician translations. Each of the selected bilingual equivalences is illustrated with a real usage example as found in the CLUVI Corpus. If you want to obtain more examples, please use the CLUVI web interface at http://sli.uvigo.es/CLUVI/. This web application permits both simple and very complex searches of isolated words or sequences of words, and shows the multilingual equivalences of the terms in context, as found in real and referenced translations. Version 0.5 of this dictionay (1173 entries, 2604 translations) contains the French-Galician lexical equivalences with highest frequency in the CLUVI. In future versions, the French-Galician CLUVI Dictionary will be completed with more French entries and more French-Galician equivalences from the CLUVI.”
Cluvi has its main sections like:
- LEGA Corpus of Galician-Spanish legal texts (6.329.655 words) [Search] [Composition]
- UNESCO Corpus of English-Galician-French-Spanish scientific-technical divulgation (3.724.620 words) [Search] [Composition]
- LOGALIZA Corpus of English-Galician software localization (2.375.157 words) [Search] [Composition
- TECTRA Corpus of English-Galician literary texts
- LEGE-BI Corpus of Basque-Spanish legal texts
- TURIGAL Corpus of Portuguese-English tourism texts
Bibliography
http://sli.uvigo.es/CLUVI/corpus.html
http://sli.uvigo.es/CLFG/info.html
http://www.spaniards.es/comment/197694
Types of corpus
There have been authors like J. Sinclair or J. Torruella and J. Llisterri, who have proposed different classification for different corpuses. For stablishing corpuses, it has to have account for some characteristics:
- Modality of language
- The size of texts
- The age which texts have
- The treatment applied to the corpus: information…
Often, these criteria come determined by the purpose that is chased by the corpus:
- the study of the work of an author
- the description of a language in general
- sublanguage or linguistic concrete aspect
- the obtaining of a certain commercial product
Bibliography
Definition of corpus
According to Wikipedia, a corpus is:
“In linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (now usually electronically stored and processed). They are used to do statistical analysis, checking occurrences or validating linguistic rules on a specific universe.” (Wikipedia)
There are several types of corpus. They are subjected to a process known as annotation
English language:
- American National Corpus
- Bank of English
- British National Corpus
- Corpus Juris Secundum
- Corpus of American English 360 million words, 1990-2007. Freely available online.
- Brown Corpus, forming part of the “Brown Family” of corpora, together with LOB, Frown and F-LOB.
- Oxford English Corpus
- Scottish Corpus of Texts & Speech
Other languages:
- Amarna letters, (for Akkadian, Egyptian, Sumerogram’s, etc.)
- Bijankhan Corpus A Contemporary Persian Corpus for NLP researches
- Croatian National Corpus
- Hamshahri Corpus A Contemporary Persian Corpus for IR researches
- Neo-Assyrian Text Corpus Project
- Persian Today Corpus
- Thesaurus Linguae Graecae (Ancient Greek)
Bibliography
European Language Resources Association

The European Language Resources Association (ELRA) was established in Luxembourg in February, 1995. ELRA is the driving force to make available the language resources for language engineering and to evaluate language engineering technologies.
We have to mention what the language resources is. The term language resources refers to a set of speech or language data and descriptions in machine readable form, used e.g. for building, improving or evaluating natural language and speech algorithms or systems, or, as core resources for the software localisation and language services industries, for language studies, electronic publishing, international transactions, subject-area specialists and end users. There are several uses in the Language Resources such as:
-
They can extend their market segments, reaching a wider audience, and making their products more accessible.
-
They can increase cost-effectiveness.
-
They can improve internal and external communication.
These are some of the uses of the Languages Resources.
One of the mission of ELRA is to promote language resources for the Human Language Technology (HLT) sector; othe mission of this driving force is evaluate language engineering technologies. This two missions can be introduced and described in the “Services around Language Resources” section. In this section we can find several points of analysis:
- Identification of language resources
– Promotion of the production of language resources
- Production of language resources
- Validation of language resources
- Evaluation of systems, products, tools, etc., related to language resources
- Distribution of language resources
- Standardisation
ELRA also regularly conducts market studies and surveys in the field of HLT, and publishes a quarterly newsletter, distributed not only to its members but also to a large number of people in the HLT community.
ELRA has several services which their actual implementation of these services is undertaken by ELRA’s distribution agency, ELDA in which we can find this services:
● The catalogue of Resources
● Identification and distribution of LRs
● Legal assistance
● Production of new LRs on demand
If someone wants to contact with them or see this page link here.
Bibliography
● ELRA home page
Ontology
In both computer science and information science, an ontology is a data model that represents a set of concepts within a domain and the relationships between those concepts. It is used to reason about the objects within that domain.
Ontologies are used in artificial intelligence, the Semantic Web, software engineering, biomedical informatics, Library Science, and information architecture as a form of knowledge representation about the world or some part of it. Common components of ontologies include:
- Individuals: the basic or “ground level” objects
- Classes: sets, collections, or types of objects[1]
- Attributes: properties, features, characteristics, or parameters that objects can have and share
- Relations: ways that objects can be related to one another
- Function terms: complex structures formed from certain relations that can be used in place of an individual term in a statement
- Restrictions: formally stated descriptions of what must be true in order for some assertion to be accepted as input
- Rules: statements in the form of an if-then (antecedent-consequent)sentence that describe the logical inferences that can be drawn from an assertion in a particular form
- Axiom: assertions (including Rules)in a logical form that together comprise the overall theory that the ontology describes, for its domain of application.
- Events: the changing of attributes or relations
Ontology languages
An ontology language is a formal language used to encode the ontology. There are a number of such languages for ontologies, both proprietary and standards-based:
- OWL is a language for making ontological statements, developed as a follow-on from RDF and RDFS, as well as earlier ontology language projects including OIL, DAML and DAML+OIL. OWL is intended to be used over the World Wide Web, and all its elements (classes, properties and individuals) are defined as RDF resources, and identified by URIs.
- KIF is a syntax for first-order logic that is based on S-expressions.
- The Cyc project has its own ontology language called CycL, based on first-order predicate calculus with some higher-order extensions.
- Rule Interchange Format (RIF) and F-Logic combine ontologies and rules.
In order to work with Ontology Languages, there are some useful technologies like Ontology Editor (to create ontologies using one of these languages), Ontology DBMS (to store and to query an ontology) and Ontology Warehouse (to integrate and to explore a set of related ontologies).
Ontology libraries
The development of ontologies for the Web has led to the apparition of services providing lists or directories of ontologies with search facility. Such directories have been called ontology libraries.
The following are static libraries of human-selected ontologies.
- The DAML Ontology Library maintains a legacy of ontologies in DAML.
- SchemaWeb is a directory of RDF schemas expressed in RDFS, OWL and DAML+OIL.
The following are both directories and search engines. They include crawlers searching the Web for well-formed ontologies.
- Swoogle is a directory and search engine for all RDF resources available on the Web, including ontologies.
- The OntoSelect Ontology Library offers similar services for RDF/S, DAML and OWL ontologies.
- Ontaria is a “searchable and browsable directory of semantic web data”, with a focus on RDF vocabularies with OWL ontologies.
BibTex
Es una herramienta para dar formato a listas de referencias que se utiliza habitualmente con el sistema de preparación de documentos LaTeX.
Información bibliografica
BibTeX usa un formato de archivo basado en texto e independiente del estilo para listas de ítems de bibliografía, como artículos, libros, tesis. Los archivos de bibliografía de BibTeX usualmente terminan en .bib. Los ítems bibliográficos incluidos en un .bib están separados por tipos. Los tipos siguientes son reconocidos por virtualmente todos los estilos de BibTeX:
- article: Un artículo de un journal o revista.
- book: Un libro con una editorial explícita. Campos requeridos: author o editor, title, publisher, year.
- booklet: Una obra que está impresa y encuadernada (bound), pero sin una editorial o institución patrocinadora (sponsoring).
- conference: Lo mismo que inproceedings, incluido para compatibilidad con el lenguaje de markup Scribe en:Scribe (markup language).
- inbook: Una parte de un libro, que puede ser un capítulo (o sección o lo que fuere) o un rango de páginas.
- incollection: Una parte de un libro que tiene su propio título.
- inproceedings: Un artículo en las actas de sesiones (proceedings) de una conferencia.
- manual: Documentación técnica.
- mastersthesis: Una tesis de maestría (Master’s en:thesis) o proyecto fin de carrera.
- misc: Para uso cuando los demás tipos no corresponden.
- phdthesis: Una tesis de doctorado (Ph.D. thesis).
- proceedings: Las actas de sesiones (proceedings) de una conferencia.
- techreport: Un reporte publicado por una escuela (school) u otra institución, usualmente numerado dentro de una serie.
- unpublished: Un documento que tiene un autor y título, pero que no fue formalmente publicado.
Creative Commons
Tiene por objetivo traducir las licencias Creative Commons tanto a los diferentes idiomas como a las diferentes legislaciones y sistemas de derechos de autor.
Las licencias Creative Commons o CC están insertas en la licencia GPL (General Public License) , aunque no son un tipo de licenciamiento de software. La idea principal es facilitar la distribución y el uso de contenidos para el dominio público.
Una de las licencias ofrecidas por Creative Commons es la que lleva por nombre “Developing Nations” (Naciones en Desarrollo). Esta licencia permite que los derechos de autor y regalías por el uso da las obras, se cobren sólo en los países desarrollados del primer mundo, mientras que se ofrecen de forma abierta en los países en vías de desarrollo.
Existen otro tipo de liecencias como:
Public Domain (pd) Developing Nations Sampling Founder’s Copyright (fc) CC-GNU GPL