Skip to main content

Posts

Showing posts with the label lecture notes Web technology

code to create a form using its elements

<html> <head> <title> Addmission</title> </head> <body> <center><h1> ADDIMISSION FORM </h1></center> <form name="regestration" > Student name:   <input type=text name=txt1><br> Father's name: <input type=text name=txt2><br> D.O.B: <input type=text name=txt3><br> Age(acc. to D.O.B): <input type=text name=txt4><br> Address: <input type=text name=txt5><br> Phone no.: <input type=text name=txt6><br> <h3> Student's gender:</h3> Male:<input type=checkbox name=ckh1><br> female:<input type=checkbox name=ckh2><br> <h3>select the class:</h3> Xth:<input type=radio name=r1><br> XIth:<input type=radio name=r2><br> XIIth:<input type=radio name=r3><br> <h3>more about student:</h3> <inp

HTML Tags

HTML COMMENTS HTML comments start with <! -- And end with -->. Each and every thing written within these characters will be ignored by the browsers. These are used to explain the purpose the HTML tags used in the documents. It can be used anywhere in the document. Following is the general syntax:- <! -- comment text --> TEXT FORMATTING PARAGRAPH HTML provides a <p> tag, which is used to start a new paragraph, escaping one line between new line and previous line. Following is the general syntax:- <p> this is a new paragraph </p> HEADING A web page contain different heading with different sizes, color and fonts. HTML provides different tags for such headings. There are 6 levels of headings h1 to h6, where h1 is the largest in size and h6 is the smallest. <h1> heading 1 </h1> <h2> heading 2 </h2> <h3> heading 3 </h3> <h4> heading 4 </h4> <h5> heading 5 </h5> <h6

Web Search Technology [Lecture notes Information retrieval]

Web search engines work by storing information about many web pages, which they retrieve from the Web itself. Crawler-based search engines have three major elements. First is the spider, also called the crawler. The spider visits a web page, reads it, and then follows links to other pages within the site [1]. This is what it means when someone refers to a site being "spidered" or "crawled". The spider returns to the site on a regular basis, such as every month or two, to look for changes. Everything the spider finds goes into the second part of the search engine, the index. The index, sometimes called the catalogue, is like a giant book containing a copy of every web page that the spider finds. If a web page changes, then this book is updated with the new information. Sometimes it can take a while for new pages or changes that the spider finds to be added to the index. Thus, a web page may have been "spidered" but not yet "indexed". Until

Modern Web IR

                                 Evolution of Modern WebIR In 1995, everything changed with the creation of the web. Web objects are the largest collection of information ever created by humans, and this collection changes continuously when new objects are created and old ones removed. In order to adapt to this changed scenario, a new discipline has been created: Web Information Retrieval [1,2,3]. It uses some concepts of traditional IR, and introduces many innovative ones. Modern WebIR [4] is a discipline which has exploited some of the classical results of information retrieval, thereby developing innovative models of information access. A recent report showed that 80% of Web surfers discover new sites (that they visit) through search engines [4,5] (such as Ask, Google, MSN or Yahoo). 1.3.1 Types of Modern WebIR Information retrieval on the Web can be broadly classified into two technologies: 1. Question Answering Systems (QA): In information retrieval , question ans

Classic Model of Web IR

The classic model for IR An IR system typically consists of three main subsystems: document representation, representation of user’s requirements (queries), and the algorithms used to match user requirements (queries) with document representations. A document collection consists of many documents containing information about various subjects or topics of interests [1]. Document contents are transformed into a document representation (either manually or automatically) which is done in a way such that matching these with queries is easy and these representations should correctly reflect the author's intention [2]. The primary concern in representation is how to select proper index terms. Typically, representation proceeds by extracting keywords that are considered as content identifiers and organizing them into a given format. Queries transform the user's information need into a form that correctly represents the user's underlying information requirement and is suitable

A Brief History of Information Retrieval

                                     The Classic Model for Information Retrieval   Information retrieval (IR) can be understood as the task of finding material (usually documents) of an unstructured nature (usually text), which satisfies information need from within large collections (usually stored on computers). Formal retrieval models have formed the basis of IR research. Since early 1960s, a number of different models have been developed to describe aspects of the retrieval task: document content and structure, inter-document linkage, queries, users, their information needs and the context in which the retrieval task is embedded. The reliability on formal retrieval models is one of the great strengths of IR research [1,2,3, 4]. While using an IR system, a user, driven by an information need, constructs a query in some query language. The query is then submitted to a system that selects from a collection of documents (corpus), those documents which match the query as ind