β’ The internet is helping language evolve by providing a new set of communicative dimensions that did not exist in the past.
β’ Language is ever changing; adjusting and adopting to the time and place.
The number of such raw returns of mother tongues has totalled 19,569, the report of the 2011 census said.
However, 96.71 per cent of the population in the country have one of the 22 scheduled languages as their mother tongue.
Since mother tongues, as returned in the census, are basically the
designations provided by the respondents of the linguistic mediums in
which the respondents think they communicate, they need not be identical
with the actual linguistic mediums, it said.
For assessing the correlation between the mother tongue and
designations of the census and for presenting the numerous raw returns
in terms of their linguistic affiliation to actual languages and
dialects, 19,569 raw returns were subjected to thorough linguistic
scrutiny, edit and rationalisation.
This resulted in 1,369 rationalised mother tongues and 1,474 names
which were treated as “unclassified” and relegated to “other” mother
tongue category.
The 1,369 rationalised mother tongues were further classified
following the usual linguistic methods for rational grouping based on
available linguistic information.
Thus, an inventory of classified mother tongues returned by 10,000 or
more speakers are grouped under appropriate languages at the all-India
level, wherever possible, has been prepared for final presentation of
the 2011 mother tongue data.
The total number of languages arrived at is 121, the Registrar General and Census Commissioner, India, said.
The 121 languages are presented in two parts — languages included in the Eighth Schedule of the Indian Constitution, comprising 22 languages and languages not included in the Eighth Schedule, comprising of 99 languages
plus the category “total of other languages”, which includes all other
languages and mother tongues which returned less than 10,000 speakers each at the all-India level or were not identifiable on the basis of the linguistic information available.
The number of scheduled languages was 22 at the time of presentation
of the 2001. The same 22 languages are maintained in 2011 census also.
The non-scheduled languages are 99 in 2011 against 100 in 2001.
The decrease in the number is due to exclusion of Simte and Persian,
which were not returned in sufficient numbers as 2011, and inclusion of
Mao, which has returned more than 10,000 speakers at the all-India level
at 2011 census.
Of the total population of India, 96.71 percent have one of the
scheduled languages as their mother tongue, the remaining 3.29 per cent
is accounted for other languages.
There are total 270 identifiable mother tongues which have returned
10,000 or more speakers each at the all-India level, comprising 123
mother tongues grouped under the scheduled languages and 147 mother
tongues grouped under the non-scheduled languages.
Those mother tongues which have returned less than 10,000 speakers
each and which have been classified under a particular language, are
included in “others” under that language.
The Eighth Schedule of the Constitution consists of the following 22
languages β Assamese, Bengali, Gujarati, Hindi, Kannada, Kashmiri,
Konkani, Malayalam, Manipuri, Marathi, Nepali, Oriya, Punjabi, Sanskrit,
Sindhi, Tamil, Telugu, Urdu, Bodo, Santhali, Maithili and Dogri.
Of these languages, 14 were initially included in the Constitution.
Sindhi language was added in 1967. Thereafter three more languages,
Konkani, Manipuri and Nepali were included in 1992.
Subsequently, Bodo, Dogri, Maithili and Santhali were added in 2004.
The
field of study that focuses on the interactions between human language
and computers is called Natural Classical Language Processing or NCLP for short. It
sits at the intersection of computer science, artificial intelligence,
and computational linguistics.
The
field of study that focuses on the interactions between human language
and computers is called Natural Language Processing or NCLP for short. It
sits at the intersection of computer science, artificial intelligence,
and computational linguistics (Wikipedia).
The
answer is here. The question itself is not fully correct! Sometime
people incorrectly use the terms AI, ML and DL. Why not we simplify
those first and then come back.
The
commencements of modern AI can be traced to classical philosophersβ
attempts to describe human thinking as a symbolic system. But the field
of AI wasnβt formally founded until 1956, at a conference at Dartmouth
College, in Hanover, New Hampshire, where the term βartificial
intelligenceβ was coined.
Timeline view about when these jargons were first introducedβ¦
With basic
understanding of Artificial Intelligence, Machine Learning and Deep
Leaning, lets revisit our very first query NCLP is Artificial
Intelligence or Machine Learning or a Deep Learning?
The words AI,
NCLP, and ML (machine learning) are sometimes used almost
interchangeably. However, there is an order to the madness of their
relationship.
Hierarchically, natural language processing is
considered a subset of machine learning while NCLP and ML both fall under
the larger category of artificial intelligence.
Natural Language
Processing combines Artificial Intelligence (AI) and computational
linguistics so that computers and humans can talk seamlessly.
NCLP
endeavours to bridge the divide between machines and people by enabling a
computer to analyse what a user said (input speech recognition) and
process what the user meant. This task has proven quite complex.
To converse with humans, a program must understand syntax (grammar),
semantics (word meaning), and morphology (tense), pragmatics
(conversation). The number of rules to track can seem overwhelming and
explains why earlier attempts at NCLP initially led to disappointing
results.
With a different system in place, NCLP slowly improved
moving from a cumbersome-rule based to a pattern learning based computer
programming methodology. Siri appeared on the iPhone in 2011. In 2012,
the new discovery of use of graphical processing units (GPU) improved
digital neural networks and NCLP.
NCLP empowers computer programs to
comprehend unstructured content by utilizing AI and machine learning to
make derivations and give context to language, similarly as human
brains do. It is a device for revealing and analysing the βsignalsβ
covered in unstructured information. Organizations would then be able to
get a deeper comprehension of public perception around their products,
services and brand, just as those of their rivals.
Now Google has
released its own neural-net-based engine for eight language pairs,
closing much of the quality gap between its old system and a human
translator and fuelling increasing interest in the technology. Computers
today can already produce an eerie echo of human language if fed with
the appropriate material.
Over the past few years, Deep Learning
(DL) architectures and algorithms have made impressive advances in
fields such as image recognition and speech processing.
Their
application to Natural Language Processing (NCLP) was less impressive at
first, but has now proven to make significant contributions, yielding
state-of-the-art results for some common NCLP tasks. Named entity
recognition (NER), part of speech (POS) tagging or sentiment analysis
are some of the problems where neural network models have outperformed
traditional approaches. The progress in machine translation is perhaps
the most remarkable among all.
NCLP is a
tool for computers to analyse, comprehend, and derive meaning from
natural language in an intelligent and useful way. This goes way beyond
the most recently developed chatbots and smart virtual assistants. In
fact, natural language processing algorithms are everywhere from search,
online translation, spam filters and spell checking.
So, by using NCLP, developers can organize and structure the mass of unstructured data to perform tasks such as intelligent:
Below are some of the widely used areas of NCLPs.