A young woman with a water pitcher

mayo 21, 2010 at 12:39 pm (Joseba Abaitua, Uncategorized)

Outline

  • Painting information
  • Meaning of the painting
  • Details in the painting
  • The face, the girl
  • The wall map
  • The pitcher and basin
  • The window

Painting information

  • This picture of the early to mid-1660s is characteristic of Vermeer’s mature style.
  • Vermeer gets a quiet balance of primary colors and simple shapes
  • The composition suits the theme of domestic, tranquility, underscored by the pitcher, traditional symbol of purity.
  • This canvas was the first of thirteen paintings by Vermeer to enter the United States between 1887 and 1919.

The meaning

  • The difference between outside and inside.
  • The window and the room.
  • The marriage and the freedom.
  • The window -> freedom
  • The room -> marriage

The face

  • She is somewhat inexpressive.
  • A similar white cap in other paintings.
  • The most characteristic -> blue colour of the headdress´ shadows.
  • Neutral gray.    

The wall map

  • The Seventeen Provinces of the Netherlands in 1671.
  • Real map.
  • Vermeer intrduces a few decorative themes to give it a new look.

The pitcher and basin

  • It plays a fundamental role in the composition. Maps were used as a decorative element in Dutch houses.  
  • The realism of both of them in the house life.
  • This model of pitcher could be  owned by Vermeer´s mother in law. A present for her Daugther.    

The window    

  • It seems to be the same as in the “Music Lesson”
  • Some observers believe that the girl is watering the flowers.
  • The glass of the window is broken down to make a mosaic.

Web pages information

Permalink Dejar un comentario

CREA & CORDE

junio 22, 2008 at 11:27 pm (Language resources)

CREA
It is a Corpus of Reference of the current Spanish.It has all the variantions which Spanish can have nowadays.This corpus has a mixture of written and oral texts. These texts are from 1975 until our age.
  1. Its structure of work is the following:
  2. It determines the dimension of the corpus
  3. It can classify texts in a chronological way, spatial, etc.
  4. Acquisition of texts
  5. It can classify texts:
Its works began in 1996.With texts edited between 1975-1999, its first phrase was in December,2000.It is a finished phase but in continuous review.It possesses 130 million forms, so that there were fulfilled the aims(lenses) marked to the beginning of the project. At the end of 2004, it possessed 170 million of forms.
Its materials are selected by a series of parameters:
1. Means :
90 % corresponds(fits) to the written language and 10 %, to the oral language.
Of this 90 %, 49 % are books, other one 49 % is remaining press and 2 % gathers the texts that we name a miscellany: leaflets, prospectos.
2. Chronological:
In periods of five years: 1975-79; 1980-84; 1985-89; 1990-94; 1995-99.
3. Origin:
The texts belong to: 50 % to Spain and other 50 % from Spanish America. 50 % of Spanish America is distributed in linguistic zones according to the number of speakers. The zones are: Andean, Caribbean, central, Chilean, Mexican text.
4. Type of text:
It is formed by three blocks of materials: books and press miscellany transcriptions of spoken language
Books and press.
They were divided in two blocks: fiction and not fiction
These two blocks were divided too in other seven blocks. These blocks are distinguished because of their capacity and the number of forms. They are:
     1. Sciences and technology
     2. Social sciences
     3. Politics, economy, and finance
     4. Arts
     5. Leisure
    6. Wealth
    7. Fiction: novel, statements, theatre
Miscellany
It is divided in two blocks:
   1. Printed
   2. Not printed
Oral corpus
It takes information from television programs or radio programs.
It has one big block which has subspecies:
    1. News
   2 . Reports
   3. Interviews
  4. Debates
  5. Gathering
  6. Documentaries
  7. Sport news
  8. Magazines
  9. Sport magazines
 10. Varieties
  11. Drawings and contests

CORDE
What made that RAE created this corpus, CORDE, was the good results obtained guring the first months of its project.
CORDE consists on something more of 300 million forms which proceeded from texts from the origin of language until 1974.
Bibliography
http://corpus.rae.es/cordenet.html
http://corpus.rae.es/ayuda_c.htm
http://www4.ujaen.es/~ncontrer/interele/links.htm
http://216.239.59.104/search?q=cache:vYr6ppD3loUJ:paginaspersonales.deusto.es/abaitua/konzeptu/ta/soria00.ppt+crea+corde&hl=es&ct=clnk&cd=9&gl=es

Permalink Dejar un comentario

Draft of the final project

junio 22, 2008 at 11:19 pm (Language resources)

My group is the A one and we chose a topic ” Review of Spanish Corpora” to make it.

Firstly, we had to find information about what was a corpus and how the use. With this information we made two parts in the final project: a theoric part, explaining what and how a corpus works and a parctical part, where we have put this information.

  1. Theoric part_ We had writen about the porpuses of a corpus, different corpuses
  2. Practical part_ We explain how the Copus works with the Mark Davies´s Corpus application and with the RAE Corpus.

Permalink Dejar un comentario

Cluvi

junio 22, 2008 at 11:04 pm (Language resources)

The French-Galician CLUVI Dictionary, according to its web page is  ”developed by the SLI (Computational Linguistics Group of the University of Vigo), is a bilingual dictionary based on a representative corpus of French texts with their Galician translation. That parallel corpus is part of the Linguistic Corpus of the University of Vigo (CLUVI). All the French words in the dictionary are found in the CLUVI Corpus, as well as all of the Galician translations. Each of the selected bilingual equivalences is illustrated with a real usage example as found in the CLUVI Corpus. If you want to obtain more examples, please use the CLUVI web interface at http://sli.uvigo.es/CLUVI/. This web application permits both simple and very complex searches of isolated words or sequences of words, and shows the multilingual equivalences of the terms in context, as found in real and referenced translations. Version 0.5 of this dictionay (1173 entries, 2604 translations) contains the French-Galician lexical equivalences with highest frequency in the CLUVI. In future versions, the French-Galician CLUVI Dictionary will be completed with more French entries and more French-Galician equivalences from the CLUVI.”

 Cluvi has its main sections like:

  •  LEGA Corpus of Galician-Spanish legal texts (6.329.655 words) [Search] [Composition]
  • UNESCO Corpus of English-Galician-French-Spanish scientific-technical divulgation (3.724.620 words) [Search] [Composition]
  • LOGALIZA Corpus of English-Galician software localization (2.375.157 words) [Search] [Composition
  • TECTRA Corpus of English-Galician literary texts
  •  LEGE-BI Corpus of Basque-Spanish legal texts
  • TURIGAL Corpus of Portuguese-English tourism texts

Bibliography

http://sli.uvigo.es/CLUVI/corpus.html

http://sli.uvigo.es/CLFG/info.html

http://www.spaniards.es/comment/197694

 

Permalink Dejar un comentario

Types of corpus

junio 20, 2008 at 3:18 pm (Language resources)

There have been authors like J. Sinclair  or J. Torruella and J. Llisterri, who have proposed different classification for different corpuses. For stablishing corpuses, it has to have account for some characteristics:

  • Modality of language
  • The size of texts
  • The age which texts have
  • The treatment applied to the corpus: information…

 Often, these criteria come determined by the purpose that is chased by the corpus:

  • the study of the work of an author
  • the description of a language in general
  • sublanguage or linguistic concrete aspect
  •  the obtaining of a certain commercial product

Bibliography

http://www3.unileon.es/dp/dfh/Milka/LCII/Tipos_corpus.htm

Permalink Dejar un comentario

Definition of corpus

junio 20, 2008 at 3:08 pm (Language resources)

According to Wikipedia, a corpus is:

“In linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (now usually electronically stored and processed). They are used to do statistical analysis, checking occurrences or validating linguistic rules on a specific universe.” (Wikipedia)

There are several types of corpus. They are subjected to a process known as annotation

English language:

Other languages:

 Bibliography

http://en.wikipedia.org/wiki/Text_corpus

Permalink Dejar un comentario

European Language Resources Association

febrero 26, 2008 at 10:51 am (Language resources)

The European Language Resources Association (ELRA) was established in Luxembourg in February, 1995. ELRA is the driving force to make available the language resources for language engineering and to evaluate language engineering technologies.
We have to mention what the language resources is. The term language resources refers to a set of speech or language data and descriptions in machine readable form, used e.g. for building, improving or evaluating natural language and speech algorithms or systems, or, as core resources for the software localisation and language services industries, for language studies, electronic publishing, international transactions, subject-area specialists and end users. There are several uses in the Language Resources such as:

  • They can extend their market segments, reaching a wider audience, and making their products more accessible.
  • They can increase cost-effectiveness.
  • They can improve internal and external communication.

These are some of the uses of the Languages Resources.
 One of the mission of ELRA is to promote language resources for the Human Language Technology (HLT) sector; othe mission of this driving force is evaluate language engineering technologies. This two missions can be introduced and described in the “Services around Language Resources” section. In this section we can find several points of analysis:
 - Identification of language resources
  – Promotion of the production of language resources
 - Production of language resources
 - Validation of language resources
 - Evaluation of systems, products, tools, etc., related to language resources
 - Distribution of language resources
 - Standardisation
ELRA also regularly conducts market studies and surveys in the field of HLT, and publishes a quarterly newsletter, distributed not only to its members but also to a large number of people in the HLT community.
ELRA has several services which their actual implementation of these services is undertaken by ELRA’s distribution agency, ELDA in which we can find this services:
● The catalogue of Resources
● Identification and distribution of LRs
● Legal assistance
● Production of new LRs on demand
If someone wants to contact with them or see this page link here.

Bibliography
ELRA home page

Permalink Dejar un comentario

Ontology

enero 19, 2008 at 1:44 pm (Joseba Abaitua, Uncategorized)

In both computer science and information science, an ontology is a data model that represents a set of concepts within a domain and the relationships between those concepts. It is used to reason about the objects within that domain.

Ontologies are used in artificial intelligence, the Semantic Web, software engineering, biomedical informatics, Library Science, and information architecture as a form of knowledge representation about the world or some part of it. Common components of ontologies include:

  • Individuals: the basic or “ground level” objects
  • Classes: sets, collections, or types of objects[1]
  • Attributes: properties, features, characteristics, or parameters that objects can have and share
  • Relations: ways that objects can be related to one another
  • Function terms: complex structures formed from certain relations that can be used in place of an individual term in a statement
  • Restrictions: formally stated descriptions of what must be true in order for some assertion to be accepted as input
  • Rules: statements in the form of an if-then (antecedent-consequent)sentence that describe the logical inferences that can be drawn from an assertion in a particular form
  • Axiom: assertions (including Rules)in a logical form that together comprise the overall theory that the ontology describes, for its domain of application.
  • Events: the changing of attributes or relations

Ontology languages

An ontology language is a formal language used to encode the ontology. There are a number of such languages for ontologies, both proprietary and standards-based:

In order to work with Ontology Languages, there are some useful technologies like Ontology Editor (to create ontologies using one of these languages), Ontology DBMS (to store and to query an ontology) and Ontology Warehouse (to integrate and to explore a set of related ontologies).

Ontology libraries

The development of ontologies for the Web has led to the apparition of services providing lists or directories of ontologies with search facility. Such directories have been called ontology libraries.

The following are static libraries of human-selected ontologies.

The following are both directories and search engines. They include crawlers searching the Web for well-formed ontologies.

  • Swoogle is a directory and search engine for all RDF resources available on the Web, including ontologies.
  • The OntoSelect Ontology Library offers similar services for RDF/S, DAML and OWL ontologies.
  • Ontaria is a “searchable and browsable directory of semantic web data”, with a focus on RDF vocabularies with OWL ontologies.

Bibliography

Permalink Dejar un comentario

BibTex

enero 15, 2008 at 9:34 am (Joseba Abaitua)

Es una herramienta para dar formato a listas de referencias que se utiliza habitualmente con el sistema de preparación de documentos LaTeX.

Información bibliografica

BibTeX usa un formato de archivo basado en texto e independiente del estilo para listas de ítems de bibliografía, como artículos, libros, tesis. Los archivos de bibliografía de BibTeX usualmente terminan en .bib. Los ítems bibliográficos incluidos en un .bib están separados por tipos. Los tipos siguientes son reconocidos por virtualmente todos los estilos de BibTeX:

  • article: Un artículo de un journal o revista.
  • book: Un libro con una editorial explícita. Campos requeridos: author o editor, title, publisher, year.
  • booklet: Una obra que está impresa y encuadernada (bound), pero sin una editorial o institución patrocinadora (sponsoring).
  • conference: Lo mismo que inproceedings, incluido para compatibilidad con el lenguaje de markup Scribe en:Scribe (markup language).
  • inbook: Una parte de un libro, que puede ser un capítulo (o sección o lo que fuere) o un rango de páginas.
  • incollection: Una parte de un libro que tiene su propio título.
  • inproceedings: Un artículo en las actas de sesiones (proceedings) de una conferencia.
  • manual: Documentación técnica.
  • mastersthesis: Una tesis de maestría (Master’s en:thesis) o proyecto fin de carrera.
  • misc: Para uso cuando los demás tipos no corresponden.
  • phdthesis: Una tesis de doctorado (Ph.D. thesis).
  • proceedings: Las actas de sesiones (proceedings) de una conferencia.
  • techreport: Un reporte publicado por una escuela (school) u otra institución, usualmente numerado dentro de una serie.
  • unpublished: Un documento que tiene un autor y título, pero que no fue formalmente publicado.

http://es.wikipedia.org/wiki/BibTeX

Permalink Dejar un comentario

Creative Commons

enero 15, 2008 at 9:25 am (Joseba Abaitua)

Tiene por objetivo traducir las licencias Creative Commons tanto a los diferentes idiomas como a las diferentes legislaciones y sistemas de derechos de autor.
Las licencias Creative Commons o CC están insertas en la licencia GPL (General Public License) , aunque no son un tipo de licenciamiento de software. La idea principal es facilitar la distribución y el uso de contenidos para el dominio público.
Una de las licencias ofrecidas por Creative Commons es la que lleva por nombre “Developing Nations” (Naciones en Desarrollo). Esta licencia permite que los derechos de autor y regalías por el uso da las obras, se cobren sólo en los países desarrollados del primer mundo, mientras que se ofrecen de forma abierta en los países en vías de desarrollo.

Existen otro tipo de liecencias como:

  • Public Domain (pd)
  • Developing Nations
  • Sampling
  • Founder’s Copyright (fc)
  • CC-GNU GPL
  • http://es.wikipedia.org/wiki/Creative_Commons

    Permalink Dejar un comentario

    Página siguiente »

    Seguir

    Get every new post delivered to your Inbox.