Dualism

April 4th, 2010

I would like to turn for a moment to the discussion of duality. Commonly when one speaks of a duality in reference to the mind body problem they are speaking of the existence of a physical substance and a non-physical mind substance. However in my recent contemplations I have narrowed the search for the dualistic properties of the mind down to the interaction between a physical substance, matter, and a non-substance, which is simple the substance of meaning and logic. It seems to me that these two things, while not commonly set in contrast to each other may play the roles of the physical substance and mind substance that are postulated in the dualistic approach.

What I mean in regards to matter is simply the material of which physicists are investigators of. Matter here is simply the collection of particles and energy in the universe. Particles which jostle around, perhaps clump together, but have no meaning or purpose. That is from a truly physical perspective you are just a mass of molecules, most of them water molecules jiggling around in a large mass. This mass of particles may have physical properties such as mass, and velocity, but without a definition of a boundary between the particles that form you and those that are not, such measurements are not feasible.

Now, in addition to your physical substance you are composed also of meaning. This meaning is what your identity is composed of. Without meaning, the mass of particles that compose your top half may as well be part of a different subject than those that form your bottom half. Meaning has a substance all it's own. The organization of this substance is studied by ontologists. It is the material of mathematics. It is what this blog post is made out of. Ideas, meanings, definitions all belong to this second class of substance.

It may be difficult to separate these to substances and perceive them as being altogether different. This is because our minds are on the threshold between these two worlds. One world composed of only substance, and the other of only meaning. These two universes are molded together in that one can be mapped onto the other. Matter can take on forms, spheres, people. Whereas some parts of the physical world resist definition. Where is the boundary of the universe and what lies beyond it, for example? Whereas there are some aspects of meaning that cannot be represented by matter, like a perfect circle. So where these worlds do unite and in as much as they are capable of overlapping is where we as conscious beings exist.

What’s the missing link in neural network theory?

June 6th, 2009

A neural network is a mathematical model that is inspired by the function of neurons in brains. These networks of virtual neurons have been successfully used to solve a large set of computational challenges. They have been used in pattern classification as well as for prediction and forecasting. Despite the success in certain applications, neural networks have not achieved the results that were anticipated decades ago in the excitement that prevailed when they were just beginning to show results. But as the years go by it seems that progress has not moved as fast as expected. Many researchers have deserted the neural network approach for a more computationally friendly and mathematically elegant approach, namely support vector machines.

So what happened? If biological neurons have the ability to solve some of the most complex problems, reasoning and vision for example, why can’t this success be duplicated with virtual neurons? Currently research efforts towards this end are focused on brain mapping and imaging. The goal is to obtain a massively high resolution map of the physical, chemical and electrical properties the brain. Presumably this data will be the key to unlock the secrets of the mind and will lead to advances in neural computation approaches.  

Perhaps we have all been looking in the wrong place. Perhaps the real problem is not lack of information but rather false assumptions that have been made about the purpose and role that neurons play in biological systems. Is it possible that machine learning researches simply don’t understand the function of neurons in our brains? Let me use an analogy. It would be like trying to reveal the source code of Microsoft Office by placing your hard disk under a microscope. Trying to decode the secrets of a computer program would be very difficult without knowledge of a computer language. So maybe there is some underlying misunderstanding of what the brain is actually doing and which functions it is performing. The distinction between the physical and the logical needs to be better understood.

Now the particular assumption that I am referring to is one that is made as a result of mathematical thinking. The assumption is about the function of artificial neural networks as they have been adopted by computer scientist. An example of neural network application is to take as input some set of values that represent the properties of an object, perform some computation and to output a number that correctly classifies the input. This idea that neurons are just fancy calculators used to compute the solution to a given problem is a false assumption. In reality neurons do not simply crunch information to produce the knowledge needed to take some action. In reality neurons go beyond this by complete the cycle by actually taking action.

The missing half of the neural network problem is action. Your neurons not only can identify that your hand is on a hot stove, it goes ahead and moves your hand away, and that is even before being processed by the cortex. Behavior is just as fundamental to biological neurons as observation.  Ever since the neuron was invented more than 500 million years ago according to evolutionary theory they have been used to enable movement and action. Almost all networks of neurons in organisms have a cycle that binds observation to behavior in a feedback loop.

Behavior Observation Cycle

This looping process incorporates a space in which a body can sense and take action within. But despite the fact that it would be hard to find an example of a real neural network that does not incorporate this process, it has been completely ignored by computer scientists in favor of a more utilitarian model that simply makes information available to a foreign observer. Perhaps developing a system to do whatever is wants to do does not seem to be the most useful thing to do. How can anyone get ahead by developing software that doesn’t provide useful information but rather it interacts with the universe and just does whatever it wants? The assumption that a neural network should be useful solely to provide knowledge is misleading. What is needed is to research the emergent properties of the behavior, observation cycle that is so fundamental to life.

The Mind and Machine Learning

May 28th, 2009

Focusing on the virtue of elegant mathematical models will surely mislead any inquirer of the nature of the mind. Most of the modern mathematical language is directly sourced to Sir Isaac Newton and his theories of the physical world. Simply because a logical relationship or a functions of variables can be written with cool looking Greek symbols and integrals does not mean it is the most usefull representation. A universally held assumption, held by those in the machine learning community, is that object labels or classifications are functions of physical quantities, namely vectors. And that they behave like physical phenomenon. But the classification of something is surely in the eye of the beholder and therefore it belongs to the domain of the mind.

Sobolev spaces

Hilbert dimension

Hilbert dimension

Ever since Galileo and his telescope overturned the spiritual worldview and gave birth to the scientific or physicalistic perspective, scientists have generally believed that all things reduce to the physical. Only a small group of scientists, mostly philosophical or religious types, prescribed to the notion of a fundamental dualism of mind and body.

 

Galileo Galilei

Galileo Galilei

 

The idea that there is a substance that is altogether different from anything that can be found in the physical world, that there exists a mind separate from the body was expressed well by Rene Descartes in his Meditations on First Philosophy:

Moreover, I find in myself diverse faculties of thinking that have each their special mode: for example, I find I possess the faculties of imagining and perceiving, without which I can indeed clearly and distinctly conceive myself as entire, but I cannot reciprocally conceive them without conceiving myself, that is to say, without an intelligent substance in which they reside, for [in the notion we have of them, or to use the terms of the schools] in their formal concept, they comprise some sort of intellection; whence I perceive that they are distinct from myself as modes are from things. I remark likewise certain other faculties, as the power of changing place, of assuming diverse figures, and the like, that cannot be conceived and cannot therefore exist, any more than the preceding, apart from a substance in which they inhere. It is very evident, however, that these faculties, if they really exist, must belong to some corporeal or extended substance, since in their clear and distinct concept there is contained some sort of extension, but no intellection at all. Further, I cannot doubt but that there is in me a certain passive faculty of perception, that is, of receiving and taking knowledge of the ideas of sensible things; but this would be useless to me, if there did not also exist in me, or in some other thing, another active faculty capable of forming and producing those ideas. But this active faculty cannot be in me [in as far as I am but a thinking thing], seeing that it does not presuppose thought, and also that those ideas are frequently produced in my mind without my contributing to it in any way, and even frequently contrary to my will. This faculty must therefore exist in some substance different from me, in which all the objective reality of the ideas that are produced by this faculty is contained formally or eminently, as I before remarked; and this substance is either a body, that is to say, a corporeal nature in which is contained formally [and in effect] all that is objectively [and by representation] in those ideas; or it is God himself, or some other creature, of a rank superior to body, in which the same is contained eminently. But as God is no deceiver, it is manifest that he does not of himself and immediately communicate those ideas to me, nor even by the intervention of any creature in which their objective reality is not formally, but only eminently, contained. For as he has given me no faculty whereby I can discover this to be the case, but, on the contrary, a very strong inclination to believe that those ideas arise from corporeal objects, I do not see how he could be vindicated from the charge of deceit, if in truth they proceeded from any other source, or were produced by other causes than corporeal things: and accordingly it must be concluded, that corporeal objects exist. Nevertheless, they are not perhaps exactly such as we perceive by the senses, for their comprehension by the senses is, in many instances, very obscure and confused; but it is at least necessary to admit that all which I clearly and distinctly conceive as in them, that is, generally speaking all that is comprehended in the object of speculative geometry, really exists external to me. 

The above reasoning does not appeal to our modern day machine learning scientists. The distinction between the physical laws of Newton and the functions of the mind needs to be drawn before progress will be made in the field of machine learning.

 

René Descartes

René Descartes

 

Drawing from Meditations on First Philosophy

Drawing from Meditations on First Philosophy

Jeff Hawkins

May 28th, 2009
Jeff Hawkins is well known for being the inventor of the Palm computer, but what is not as well known is that he is also an avid researcher of neuroscience and machine learning. His primary focus as a theoretical neuroscientist is the function of the neo cortex, the region of the brain responsible for advanced reasoning. His work in machine learning is described in his book On Intelligence, which is widely available in book stores and via audible.com.

On Intelligence - Jeff Hawkins

On Intelligence - Jeff Hawkins

His focus on studying the function of the brain brings an outsiders perspective to machine learning. He has developed a new machine learning framework called Hierarchal Temporal Models. What makes this approach different is that it is based on discoveries made about the organization of the neo cortex, and specifically how models of patterns are stored hierarchaly. His company, Numenta, Inc. currently owns the patents for this technology. According the Numenta website the name Hierarchical Temporal Memory is defined as follows:

Hierarchical -- HTMs are organized as a tree-shaped hierarchy of nodes. Each node implements a learning and memory function, that is, it encapsulates an algorithm. Lower-level nodes receive large amounts of input and send processed input up to the next level. In that way, the HTM Network abstracts the information as it is passed up the hierarchy.

Temporal -- During training, the HTM application must be presented with objects as they change over time. For example, during training of the Pictures application, the images are presented first top to bottom, then left to right as if the image were moving over time. Note that the temporal element is critical: The algorithm has been written to expect input that changes gradually over time.

Memory -- An HTM application works in two stages, which can be thought of as training memory and using memory. During training, the HTM Network learns to recognize patterns in the input it receives. Each level in the hierarchy is trained separately. In the fully trained HTM Network, each level in the hierarchy knows -- has in memory -- all the objects in its world. During inference, when the HTM Network is presented with new objects, it can determine the likelihood that an object is one of the already known objects.

What is great about Mr. Hawkins style and method is that he is more focused on practical functions such as the actual function of the brain and less on abstract mathematical concepts.

Machine Learning Market Research Report

May 26th, 2009

Today I have compiled a market research report of publicly traded companies here in the United Sates that have significant intellectual property in machine learning. Of course there are additional firms that have made advances in machine learning. Some of the machine learning companies that did not make it into this report because they are either private companies such as Numenta, Inc. or foreign such as NEC Corp. will be discussed in later posts.

Also, I have installed a stock ticker on the right to track the progress of these machine learning innovators in the marketplace. Each machine learning firm detailed in this report has received five or more patents for machine learning technologies. Some of the relevant patents are listed. Also an excerpt from the companies website pertaining to machine learning technology has been included as well as a direct link to the referenced material.

 

Symbol
Company

 

Lucent Technologies Inc.

 

From www.alcatel-lucent.com: "Improving the intelligence of mobile communication networks at the architecture level calls of innovative new thinking. In today’s networks, delivering true service personalization would require hundreds or thousands of mini base stations and cellular transmitters meshed into a single seamless network. With existing network architectures, cost and space constraints make this kind of configuration impractical and the manual, labor-intensive process of configuring and managing this sot of network is impractical. This is only accomplished through automating the entire process using advances in machine learning and self optimization/network maintenance techniques."


Patents from www.google.com/patents

Apparatus and methods for machine learning hypotheses

US Pat. 5819247 - Filed Jul 29, 1997 - Lucent Technologies, Inc.

Primary Examiner — Allen R. MacDonald Assistant Examiner — Jeffrey S. Smith [57]

ABSTRACT Apparatus and methods lor machine learning the hypotheses used ...

Method and apparatus for determining the accuracy limit of a learning ...

US Pat. 5720003 - Filed Feb 17, 1995 - Lucent Technologies Inc.

Chiang, W.-P., Cortes, C, Jackel, LD, LeCun, Y., and Lee, W., "Accuracy Limits

of Machine Learning: Predicting Communication Path Degradation or Failure," ...

Rule induction on large noisy data sets

US Pat. 5719692 - Filed Jul 7, 1995 - Lucent Technologies Inc.

P. Clark, R. Boswell, "Rule Induction with CN2 Some Recent Improvements",

Machine Learning — Proceedings of the Fifth European Conf. ...

Method and apparatus for determining the limit on learning machine accuracy ...

US Pat. 5684929 - Filed Oct 27, 1994 - Lucent Technologies Inc.

WP Chiang, C. Cortes, LD Jackel, Y. LeCun, W. Lee, Accuracy Limits of Machine

Learning: Predicting Communication Path Degradation or Failure, presented at ...

Method and apparatus for improving the efficiency of support vector machines

US Pat. 6134344 - Filed Jun 26, 1997 - Lucent Technologies Inc.

382/ OTHER PUBLICATIONS CJ Burges "Simplified Support Vector Decision Rules"

Machine Learning, Proceedings ol the Thirteenth International Conference (ICML

 

Cadence Design Systems, Inc.

 

From www.cadence.com: "Cadence Research Laboratories (CRL) was established in 1993 in Berkeley to focus on advanced research in Electronic Design Automation. We study the mathematical and algorithmic foundations for modeling, analyzing, and optimizing complex integrated systems. Our mission is broadly stated—Increase external visibility of Cadence as a technology leader, and help make its products successful in the marketplace. We are actively engaged with Cadence's business units. Our contributions include developing prototype components for proving new concepts, consulting on operational and strategic decisions, generating code for products, and directing customer engagements to discover new challenges and receive feedback on new ideas. We are also involved in the external research community through regular publications of scientific papers and active leadership on conference committees and in other organizations. CRL maintains a variety of collaborations with other research institutions and universities; we are particularly involved with the University of California at Berkeley."


Patents from www.google.com/patents

System and method for generating and using stage-based constraints for ...

US Pat. 6263478 - Filed Aug 11, 1998 - Cadence Design Systems, Inc.

88-91, IEEE International Conference on Com- puter-Aided Design ICCAD-87, Nov. 9

-,, Santa Clara, California. Cadence Design Systems, Inc., ...

Hierarchical, rules-based, general property visualization and editing method ...

US Pat. 7367006 - Filed Jan 11, 2005 - Cadence Design Systems, Inc.

Cadence Design Systems, Inc. "Virtuoso® Spectre® Circuit Simulator User Guide"

Product ... Cadence Design Systems, Inc. "Simulating Complex RF/Mixed- Signal

...

Mixed signal synthesis behavioral models and use in circuit design optimization

US Pat. 6637018 - Filed Oct 26, 2000 - Cadence Design Systems, Inc.

Antrim Design Systems, Inc., "The Characterization and Behavioral Model

Generation ol Analog Intellectual Property," Antrim Design Systems White Paper,

May, ...

Block based design methodology

US Pat. 6631470 - Filed Mar 23, 2001 - Cadence Design Systems, Inc.

"Block-Based Design Methodology Documentation," by Cadence Design Systems, Inc.,

Services Reseach & Development, Project Alba, Version, May 21,. ...

Identifying overconstraints using port abstraction graphs

US Pat. 5568396 - Filed Jan 21, 1994 - Cadence Design Systems, Inc.

[73] Assignee: Cadence Design Systems, Inc., San Jose, Calif. [21] Appl. No.: [

22] Filed: Jan. 21, [51] Int. CI.6 G06F/50 [52] US Cl 364/491; 364/488; ...

 

Google, Inc.

 

From www.google.com: "Google New York is the place to be for software engineers who want to push the limits on distributed systems, scale and performance. Here, we work on projects of massive scale, develop and build great systems, and produce quick and accurate solutions to challenging problems. At all times, we match projects with our employees' background and interests. Google engineers are experts in C++, Java, AJAX and HTML and focus on these key areas: Search and advertising quality We parse user queries, select and rank matching pages, summarize results for the user, and match ads to queries and Web pages. We immerse ourselves in various technologies, including artificial intelligence (AI), data mining, machine learning, natural language processing, and information retrieval and extraction."


Patents from www.google.com/patents

Large scale machine learning systems and methods

US Pat. 7222127 - Filed Dec 15, 2003 - Google Inc.

Rule-based Machine Learning Methods for Functional Prediction, Journal oif Al

Research, vol. 3, Dec., pp. 383- 403.* (Continued) Primary Examiner — Greta ...

System and methods for automatically creating lists

US Pat. 7350187 - Filed Apr 30, 2003 - Google Inc.

... "Employing EM and Pool-Rased Active Learning for Text Classification;"

Proceedings of the Fifteenth International Conference on Machine Learning;; ...

Methods and systems for identifying manipulated articles

US Pat. 7302645 - Filed Dec 10, 2003 - Google Inc.

Alternatively, a machine learning approach can be used to define the rules. With

the machine learning approach, a set of clusters, know as a training set, ...

System and method for supporting editorial opinion in the ranking of search ...

US Pat. 7096214 - Filed Dec 13, 2000 - Google Inc.

Koller, D. and M. Sahami, "Hierarchically Classifying Documents Using Very Few

Words," International Conference on Machine Learning,, pp.-. ...

Systems and methods for performing in-context searching

US Pat. 7305380 - Filed Dec 13, 2000 - Google Inc.

705/26 OTHER PUBLICATIONS D. Koller et al., "Hierarchically Classifying

Documents Using Very Few Words," International Conference on Machine Learning,,

pp. ...

 

Health Discovery Corp. 

 

From www.healthdiscoverycorp.com: "Health Discovery Corporation's SVM technology outperforms even advanced statistical modeling methodologies such as neural networks. Neural networks suffer from a limited ability to handle data and can only analyze the data from two or three dimensions. Support Vector Machines, however, are able to process infinite amounts of data and to analyze the data to find separations and delineations high dimensionality. The Company’s SVM technology is commonly considered within the context of artificial intelligence. This is a branch of computer science concerned with giving computers the ability to perform functions normally associated with human intelligence, such as reasoning and optimization through experience. Machine learning is a type of artificial intelligence that enables the development of algorithms and techniques that allow computers to learn. Pattern recognition is machine learning with a wide spectrum of applications including medical diagnosis, bioinformatics, classifying DNA sequences, detecting credit card fraud, stock market analysis, object recognition in computer vision, and robot locomotion."


Patents from www.google.com/patents

Methods for feature selection in a learning machine

US Pat. 7318051 - Filed May 20, 2002 - Health Discovery Corporation

"Tumour class prediction and discovery by microarray-based DNA methylation

analysis" ... of the Sixteenth International Conference on Machine Learning, Jun

. ...

Spectral kernels for learning machines

US Pat. 6944602 - Filed Mar 1, 2002 - Health Discovery Corporation

... Berkeley, CA (US) (73) Assignee: Health Discovery Corporation, Savannah, ...

and spectral graph theory lor solving problems ol machine learning. ...

Methods of identifying patterns in biological systems and uses thereof

US Pat. 7117188 - Filed Jan 24, 2002 - Health Discovery Corporation

Knowledge discovery is the most desirable end product of data collection. ... In

recent years, machine-learning approaches for data 30 analysis have been ...

Computer-aided image analysis

US Pat. 6996549 - Filed Jan 23, 2002 - Health Discovery Corporation

In recent years, machine-learning approaches for image analysis have been widely

... a case that can easily obscure the optimal solution from discovery. ...

Kernels and methods for selecting kernels for use in learning machines

US Pat. 7353215 - Filed May 7, 2002 - Health Discovery Corporation

Niyogi, P., "Incorporation Prior Information in Machine Learning by Creating ...

First International Conference on knowledge Discovery & Data Mining, 1995, ...

 

Honeywell International Inc.

 

From www.honeywell.com: "HTSL’s Research group, works with Honeywell product R&D groups and Honeywell Research Labs globally develops key technologies that enhance Honeywell products to benefit its customers. It also collaborates with premier universities and other research institutes regionally and globally. The computation and communication systems (CCS) lab focuses on core research in the fields of computation systems with a primary focus on wireless system design. The CCS lab has core competencies in research pertaining to algorithm design, analysis and parallelization, large scale complex system design and implementation, model based development, modeling and simulation of networks and performance analysis, etc,. The Intelligence Vehicle Technologies group works on autonomous vehicle systems for various applications like Unmanned Air Vehicles, Mining, Agriculture, Smart transportation systems, Robot vehicles, Marine vessels, etc,. The focus areas include navigation, perception sensing, guidance, x-by-wire controls, multi-sensor fusion, fault tolerance and reconfiguration, smart communication and computing platforms with competencies ranging from multi-sensor fusion algorithms, Pattern Recognition, Machine Learning to Vision sensor processing techniques, etc,.


Patents from www.google.com/patents

Distribution theory based enrichment of sparse data for machine learning

US Pat. 7127435 - Filed Jul 3, 2001 - Honeywell International Inc.

706/20 706/ A technique for enriching sparse data for machine learning

techniques such as supervised artificial neural network includes receiving the

sparse ...

Combinatorial approach for supervised neural network learning

US Pat. 6954744 - Filed Aug 29, 2001 - Honeywell International, Inc.

The technique also includes performing supervised machine learning using the

specified neural network architecture, initialized weights, and the read data

...

Genetic algorithm synthesis of neural networks

US Pat. 5140530 - Filed Mar 28, 1989 - Honeywell Inc.

Proceedings of the 5th International Conference on Machine Learning, Ann Arbor,

Mich. pp. 153-161. Carpenter, GA, and S. Grossberg. (1988). ...

Adaptive knowledge management system for vehicle trend monitoring, health ...

US Pat. 6907416 - Filed Jun 4, 2001 - Honeywell International Inc.

It uses a plurality of Analytical and Machine Learning tools for capturing

knowledge from data sources and populating cells of the Structured Knowledge ...

Neuro/fuzzy hybrid approach to clustering data

US Pat. 6904420 - Filed May 17, 2001 - Honeywell International Inc.

... pattern prediction, and supervised machine learning systems. 24. The

computer readable medium of claim 21, further comprising: if the received data

is ...

 

International Business Machines Corporation

 

From ibm.com: "The Machine Learning Group specializes in developing algorithms for automatic pattern recognition, prediction, analysis, classification, and learning of structures. We supply both core technologies and machine learning services. Our core technologies include: Bayesian networks Learning and classifying structures Anomaly detection Feature selection Time series analysis Support Vector Machines (SVM)


Patents from www.google.com/patents

Machine learning based electronic messaging system

US Pat. 6424997 - Filed Jan 27, 1999 - International Business Machines Corporation

23, 2002 (54) MACHINE LEARNING BASED ELECTRONIC MESSAGING SYSTEM (75) Inventors:

... MD (US) (73) Assignee: International Business Machines Corporation, ...

Model selection in machine learning with applications to document clustering

US Pat. 6584456 - Filed Jun 19, 2000 - International Business Machines Corporation

24, 2003 (54) MODEL SELECTION IN MACHINE LEARNING WITH APPLICATIONS TO DOCUMENT

... CA (US) (73) Assignee: International Business Machines Corporation, ...

Learning system with prototype replacement

US Pat. 5649070 - Filed Feb 17, 1995 - International Business Machines Corporation

... NY [73] Assignee: International Business Machines Corporation, Armonk, ...

RS Michalski et al, "NSF/DARPA Workshop on Machine Learning and Vision: A ...

Scalable set oriented classifier

US Pat. 5899992 - Filed Feb 14, 1997 - International Business Machines Corporation

[73] Assignee: International Business Machines Corporation, Armonk, ... Quinlan,

JR, "Combining Instance-Based and Model- Based Learning", Machine Learning, ...

Method and apparatus for generating a data classification model using ...

US Pat. 6728689 - Filed Nov 14, 2000 - International Business Machines Corporation

... Stamford, CT (US) (73) Assignee: International Business Machines Corporation

, ... ence on Machine Learning ().* Thrun et al., "Learning One More Thing," ...

 

Microsoft Corporation

 

From microsoft.com: "The Machine Learning and Applied Statistics (MLAS) group is focused on learning from data and data mining. By building software that automatically learns from data, we enable applications that (1) do intelligent tasks such as handwriting recognition and natural-language processing, and (2) help human data analysts more easily explore and better understand their data. We strive to advance the state of the art in machine learning and statistics, develop fast scalable algorithms for learning and mining, implement portions our work toolkits, and apply our work to numerous product applications."


Patents from www.google.com/patents

Method for boosting the performance of machine-learning classifiers

US Pat. 7024033 - Filed Mar 4, 2002 - Microsoft Corp.

International Conference on Machine Learning, pp. 322-330. Morgan Kaufmann,.

Schapire, RE and Y. Singer. Improved boosting algorithms using confidence-rated

Training machine learning by sequential conditional generalized iterative ...

US Pat. 7107207 - Filed Jun 19, 2002 - Microsoft Corporation

... ABSTRACT A system and method facilitating training machine learning systems

utilizing sequential conditional generalized iterative scaling is provided. ...

Technique which utilizes a probabilistic classifier to detect "junk" e-mail ...

US Pat. 6161130 - Filed Jun 23, 1998 - Microsoft Corporation

William W. Cohen, "Learning Rules that Classify E-Mail", In the Proceedings of

the 1996 AAAI Spring Symposium on Machine Learning in Information Access. ...

Layered models for context awareness

US Pat. 7203635 - Filed Jun 27, 2002 - Microsoft Corporation

T. Joachims, Text categorization with support vector machines: learning with

many relevant features, Machine Learning, European Conference on Machine ...

Method and apparatus for fast machine training

US Pat. 6697769 - Filed Jan 21, 2000 - Microsoft Corporation

Machine learning is a general term that describes auto- 10 matically setting the

... One common use for machine learning is the training of parameters for a ...

 

Koninklijke Philips Electronics N.V

 

From philips.com: "Philips’ brand positioning Sense and Simplicity underlines its aim of delivering advanced products and applications that are meaningful and easy-to-use. To do so, we need to fully understand how people experience technology. As all experiences in a person’s life are made up of a combination of seeing, hearing, feeling, tasting and smelling, studying these basic modalities leads to fundamental insights that can be applied in human-centric design. Human perception is an important research field that has already led to many innovative products and applications.


Patents from www.google.com/patents

Method and apparatus for partitioning a plurality of items into groups of ...

US Pat. 6801917 - Filed Nov 13, 2001 - Koninklijke Philips Electronics N.V.

... Yorktown Heights, NY (US) (73) Assignee: Koninklijke Philips Electronics NV,

... for Learning With Symbolic Features" Machine Learning, vol.,, pp. ...

Electronic program guide viewing history generator method and system

US Pat. 6934964 - Filed Feb 8, 2000 - Koninklijke Philips Electronics N.V.

For example, current systems such as Philips® unreliable pattern or rule (ie,

... 2J considerations and the machine-learning method being used, ...

Search user interface providing mechanism for manipulation of explicit and ...

US Pat. 6662177 - Filed Mar 29, 2000 - Koninklijke Philips Electronics N.V.

A machine-learning algorithm is used to derive a model by which user-preferences

can be predicted. Explicit profiles are rules entered by the user. ...

User interface/entertainment device that simulates personal interaction and ...

US Pat. 6795808 - Filed Oct 30, 2000 - Koninklijke Philips Electronics N.V.

It could look for cues from the particular user indicating that the end of his/

her response has been reached by feeding, to an internal machine-learning ...

Environment-responsive user interface/entertainment device that simulates ...

US Pat. 6721706 - Filed Oct 30, 2000 - Koninklijke Philips Electronics N.V.

It could look for cues from the particular user indicating that the end of his/

her response has been reached by feeding, to an internal machine-learning ...

 

 

Science Applications International Corporation

 

From saic.com: "Discovering Hidden PatternsSAIC's Executive Science and Technology Council (ESTC) promotes high-quality technical work by presenting yearly awards for papers published in peer-reviewed journals. A recent EST Award winner detailed a breakthrough in working with huge sets of data. From medical imaging to optimizing oil wells, many industries produce increasingly huge volumes of data from their high-tech applications. To better organize and analyze the enormous sets of data from such applications, researchers use algorithms that cluster data — break it into smaller, related groups. But data clustering — which plays a central role in data mining techniques and classifying retrieved Web pages — does not always reveal links between seemingly unrelated information.


Patents from www.google.com/patents

Machine learning of document templates for data extraction

US Pat. 7149347 - Filed Mar 2, 2000 - Science Applications International Corporation

... Assignee: Science Applications International Corporation, San Diego, ... The

present system can perform machine learning of prototypical descriptions of ...

Ontology-based parser for natural language processing

US Pat. 7027974 - Filed Oct 27, 2000 - Science Applications International Corporation

Building Domain-Specific Search Engines with Machine Learning Techniques. ...

variety of applications, including search engines, summarization applications,

...

Method and system of ranking and clustering for document indexing and retrieval

US Pat. 6766316 - Filed Jan 18, 2001 - Science Applications International Corporation

... Carlsbad, CA (US) (73) Assignee: Science Applications International

Corporation, ... Ltd. Tom M. Mitchell, "Machine Learning", WCB/McGraw-Hill. ...

Systems and methods for monitoring health and delivering drugs transdermally

US Pat. 6887202 - Filed May 30, 2001 - Science Applications International Corporation, Georgetown, University

Machine-learning algorithms are preferably used to 20 acquire a ... can be

easily adapted for many innovative applications by applying new chemistries for

...

Method and system for extracting information from a document

US Pat. 7142728 - Filed May 17, 2002 - Science Applications International Corporation

... CROSS REFERENCE TO RELATED APPLICATIONS 5 The present application hereby ...

"Machine Learning of Document Templates for Data Extraction," filed Mar. ...

 

Silicon Graphics, Inc.

 

From sgi.com: "MLC++ provides general machine learning algorithms that can be used by end users, analysts, professionals, and researchers. The main objective is to provide users with a wide variety of tools that can help mine data, accelerate development of new mining algorithms, increase software reliability, provide comparison tools, and display information visually. More than just a collection of existing algorithms, MLC++ is an attempt to extract commonalities of machine learning algorithms and decompose them for a unified view that is simple, coherent, and extensible.


Patents from www.google.com/patents




System and method for selection of important attributes

US Pat. 6026399 - Filed May 30, 1997 - Silicon Graphics, Inc.

Quinlan, JR, C4.5: Programs for Machine Learning, Morgan Kaufmann Publishers,

Inc., pp. 17-26 (1993). Quinlan, JR, "Induction of Decision Trees," Machine ...

Method, system, and computer program product for visualizing a decision-tree ...

US Pat. 6278464 - Filed Mar 7, 1997 - Silicon Graphics, Inc.

Quinlan, JR, C4.5: Programs for Machine Learning, Morgan Kaufmann Publishers,

Inc., pp. 17-26 (1993). Quinlan, JR, "Induction of Decision Trees," Machine ...

Computer-related method, system, and program product for controlling data ...

US Pat. 6480194 - Filed Nov 12, 1996 - Silicon Graphics, Inc.

Kohavi, R., "Scaling Up the Accuracy of Naive-Bayes Classifiers: a Decision-Tree

Hybrid," In Data Mining and Visualization, Silicon Graphics, Inc., ...

Method, system, and computer program product for visualizing a data structure

US Pat. 6301579 - Filed Oct 20, 1998 - Silicon Graphics, Inc.

5 : Programs for Machine Learning, Morgan Kaufmann Publishers, Inc., pp. ...

Rendering Algorithm for Visualizing 3D Scalar Fields," Computer Graphics, vol.

...

Method, system, and computer program product for mapping between an overview ...

US Pat. 6259451 - Filed Apr 25, 1997 - Silicon Graphics, Inc.

Quinlan, JR, C4.5: Programs for Machine Learning, Morgan Kaufmann Publishers,

Inc., pp. 17-26 (1993). Quinlan, JR, "Induction of Decision Trees," Machine ...

 

 

AT&T Corp.

 

From att.com: "Machine Learning describes a set of highly generic tools that enable learning from examples, and alleviate the need for hand-crafted expert systems. AT&T-pioneered ML solutions rely on sophisticated mathematical optimization techniques and scale well to large problems. Automates speech-enabled services creation and maintenance. Creates accurate speech models from examples. Reduces reliance on expert knowledge. Enables automatic reporting and speech mining. Supports scalability efforts. Allows zero-touch service improvements (reducing defects). Supports customer-centric performance measures (e.g., call classification accuracy). Key differentiator in AT&T VoiceTone® offers. 60% reduction of manual labeling efforts. Automatic creation of data mining reports. Automatic customer call-flow analysis, problem pinpointing, SLA verification. Business intelligence extraction. Strong emphasis on reusable software libraries: techniques easy to transfer to IP traffic measurement and monitoring applications.


Patents from www.google.com/patents

System and method for finding information in a distributed information ...

US Pat. 6418432 - Filed Jul 24, 1998 - AT&T Corporation

In Machine Learning: Proceedings of the Twelfth International Conference, Lake

Taho, ... C4,5: programs for machine learning. Morgan Kaufmann, 1994. ...

Biased learning system

US Pat. 5481650 - Filed Oct 7, 1994 - AT&T Corp.

... NJ [73] Assignee: AT&T Corp., Murray Hill, NJ [21] Appl. No. ... to Learning

Relational Concepts", Proc of 8th Int'l Workshop on Machine Learning,. ...

Method of active learning for automatic speech recognition

US Pat. 7149687 - Filed Dec 24, 2002 - AT&T Corp.

... NJ (US); Giuseppe Riccardi, Hoboken, NJ (US) (73) Assignee: AT&T Corp., New

York, ... in Proc. of the Inernational Conference on Machine Learning,, pp. ...

Module for constructing trainable modular network in which each module ...

US Pat. 6128606 - Filed Mar 11, 1997 - AT&T Corporation

Primary Examiner — -Tariq R. Hafiz Assistant Examiner — -Jason W. Rhodes

Attorney, Agent, or Firm — Kenyon & Kenyon [57] ABSTRACT A machine learning ...

Automatic temporospatial pattern analysis and prediction in a ...

US Pat. 5724263 - Filed May 30, 1997 - AT&T Corp

... Highland Park, all of NJ [73] Assignee: AT&T Corp, Middletown, NJ [21] Appl.

No. ... "Case-Based Reasoning from DARPA: Machine Learning Program Plan", ...

 

Google Machine Learning Patents

May 22nd, 2009

What makes machine learning such an exciting field is not only its philosophical implications towards the mind and intelligence but also its widespread adoption into much of today’s cutting edge technology. You may be surprised to discover just how much new technology is driven by machine learning. Most new digital cameras incorporate face recognition, automobile engines optimize fuel efficiency with machine learning algorithims, banks use it to monitor your credit card spending for fraud. But one of the most exciting, and lest well understood, applications  is at the core of Google’s search technology.

Google’s methods have been a well kept secret. If you where to ask Google how their search engine works they will refer you to their Pigeon Rank technology website which satirically explains how pigeons are used to rank web pages. Google is not about to make its intellectual property public knowledge. If it did Microsoft would surely have a better search engine.

 

A diagram Google offers of its Pigeon Rank technology.
A diagram Google offers of its Pigeon Rank technology.

 

But, the nature of Google search technology is not completely hidden. Patent applications as well as job posting,  research publications and professional conferences reveal just how important machine learning is to Google.

The very first United States Patent Issued to Google was a “Large scale machine learning systems and methods”, patent number 7,222,127 originally filed on December 15, 2003. The language of the patent is very intentionally vague, referring not to web sites, or to web pages, but rather only  to “nodes.“

What exactly is this invention? Well according to the patent it is:

 A system for generating a model, comprising: a plurality of nodes, at least one of the nodes being configured to: receive candidate conditions, generate statistics associated with at least a first one of the candidate conditions, send the generated statistics to at least one other one of the nodes, receive statistics regarding at least a second one of the candidate conditions from other ones of the nodes, form a rule based, at least in part, on the second candidate condition and the received statistics, add the rule to the model, and outputting the rule to at least one of the nodes.  

Looking past the obfuscation, it is a method for modeling documents such as web pages for the purpose of classifying them. It builds a model of a network by indentifying conditional relationships between pairs of nodes based on special, not very well defined, conditons. It is intentionally applied so broadly, using terms like node, that it could apply to jsut about anything.

The patent is summurized as follows:  

Systems and methods consistent with the principles of the invention may apply machine learning to large data sets, such as data sets including over one hundred thousand features and/or one million instances. The systems and methods may be capable of processing a large data set in a reasonable amount of time to generate a classification model.

Different models may be generated for use in different contexts. For example, in an exemplary e-mail context, a model may be generated to classify e-mail as either spam or normal (non-spam) e-mail. In an exemplary advertisement context, a model may be generated to estimate the probability that a user will click on a particular advertisement. In an exemplary document ranking context, a model may be generated in connection with a search to estimate the probability that a user will find a particular search result relevant. Other models may be generated in other contexts where a large number of data items exist as training data to train the model.

A more serious diagram. Figure 1 from Google Patent 7,222,127

A more serious diagram. Figure 1 from Google Patent 7,222,127

Figure 5 from Google Patent number 7,222,127

Figure 5 from Google Patent number 7,222,127

This patent is only the first one Google received. Searching the USPTO.gov website reveals dozens of additional patent applications as well as issued patents that relate to machine learning techniques. 

Allot of money has and will continue to be invested into Machine Learning technology. Learning algorithims are at the heart of todays most exciting research so this is a great time to get involved. 

So what is Machine Learning?

May 21st, 2009

In plain English, machine learning is the field where researches program computers to recognize patterns, the kind of patterns our brains recognize with ease. How do you recognize your friend’s voice over the telephone, or how can you tell the difference between a cat and a dog? These tasks are trivial for you and I, but computers struggle with even the most simple pattern recognition tasks, despite decades of research.

DogCatcatdog
You can classify these images, but can a computer?

While no one knows precisely how our brains can identify the subjects of the above photos, some smart scientists have been working on the problem of pattern classification from a mathematical perspective. They have created a standard by which a classification problem can be broken down into a system of variables that can be analyzed using statistical methods. Let’s see how this works. Let’s take the example of how to recognize if a photograph contains an image of a dog or a cat. Let us ignore for now the possibility that the photograph is of neither, or both.

The first step in a machine learning application is what is called feature extraction. An example of a simple feature could be the length or height of the subject. Also, more complex features could be the shape of the head, the posture, etc. Once the features of the subject in question have been identified and given a value, for example 18 inches tall and 25 inches in length, the machine learning algorithm will compare these values statistically with the values of known subjects. One can visualize this process on a table where the horizontal axis represents length and the vertical axis represents height. Dogs are known to be bigger than cats for example, so generally a cat will be represented by a dot at the lower left, and a dog at the upper right. In theory, by plotting the features this way, there should be two distinct regions separated by the classification boundary. The goal of any good machine learning algorithm is to find the optimal boundary. The problem with this method is clear, obviously not all dogs are taller and longer than all cats. And what about a catdog? We can recognize one without confusion, but any machine learning algorithm would struggle with such a classification.

Classification plot of length and height of sample dogs and cats.

Classification plot of length and height of sample dogs and cats.

There is clearly a lot of room in machine learning for innovation. Keep in mind also that the example given here is only a simplistic example. But, generally the problem of feature extraction is one that is at the core of machine learning.

CS 229 – Stanford Machine Learning Course

May 19th, 2009

This is an excellent online resource for an introduction to Machine Learning. In a series of 20 lectures, Professor Andrew Ng introduces his students to the rigorous mathematics of Machine Learning. His focus on the purely mathematical and whimsical mockery of graphical models will surely confound those who dislike math. But I recommend these lectures to anyone who is a serious student of machine learning because it will help to build a strong foundation of mathematical principals needed to understand the perspective of the machine learning community. Also check out the course hand outs. These are nearly as detailed and information packed as a machine learning text book, and they are free.

The course hand out website is:
http://www.stanford.edu/class/cs229/materials.html

The playlist website is:
http://www.youtube.com/view_play_list?p=A89DCFA6ADACE599

Welcome to ML Intelligence

May 19th, 2009

Welcome to ML Intelligence.
Discussion about Machine Learning, Artificial Intelligence and Statistics presented in a fully graphical style.