Web Analytics Terms

A
Abandonment: is the measure when a visitor exits or leaves a conversion process on a website and does not return later during the session.
A/B Testing: A method of banner ads, emails and landing pages testing by which a baseline control sample is compared to a variety of single-variable test samples.
Accuracy: The ability of a measurement to match the actual value of the quantity being measured. Accuracy is the foundation upon which your web analytics should be built. If you can't trust that your data is accurate, you can't make confident decisions. In statistical terms, accuracy is the width of the confidence interval for a desired confidence level.
Acknowledgement Page: A page displayed after a visitor completes an action or transaction. For example, a thank-you or a receipt page. Acknowledgement pages are often important in Scenario Analysis, where it is an indicator of a completed scenario.
Acquisition: The process of gaining customers through the means of different marketing strategies. For the purposes of web analytics, it often refers specifically to the process of attracting visitors to a web site.
ACT: After-Click Tracking is the recording the activity path of a visitor to a site after they have clicked on an email link.
Actionable Data: Information that allows you to make a decision or can be made use of in any way.
Ad: A link that takes a visitor to a web site when clicked on, usually graphic or text.
Ad Click: A click on an advertisement on a website which takes a user to another site.
Ad Hoc Query: A non-standard inquiry posed to a database of information as the need arises.
Ad View: A web page that presents an ad. There may be more than one ad on an ad view. Once visitors have viewed an ad, they can click on it.
Affiliate Marketing: A method of promoting web businesses in which an affiliate is rewarded for providing customers. Compensation could be made based on a value for visits, subscriptions, leads, sales, and so on.
Aggregate Data: A summary of collected information which groups data together without individual-level statistics.
Algorithm: A mathematical formula used by search engines to determine which web sites in their database to present in search results, in which order. While search engine algorithms change regularly, primary on-page factors include keyword density and source code optimization. The primary off-page factor is link popularity.
API: Application Programming Interface is a system that a computer or application supplies in order to allow requests for service to be made of it by other computer programs. APIs allow data to be exchanged between computer programs, and a standard software API method includes Open Database Connectivity (ODBC).
ASP: Active Server Pages are a set of software components that run on a web server and let developers build dynamic web pages.
Attrition: The erosion of your customer base over time. The opposite of customer retention.
Authentication: The technique by which access to Internet or intranet resources requires the user to enter a username and password as identification.
Average Lifetime Value: The average of the lifetime value of a visitor or multiple visitors during the reporting period, where each visitor's lifetime value is the total monetary value of a visitor's past orders since visitor tracking began.
B
Bandwidth: Measure, in kilobytes of data transferred, of the traffic on a site.
Banner Ad: An advertisement embedded on a web page usually intended to drive traffic to a different website by linking to the advertiser's site. The Interactive Advertising Bureau (IAB) has created a standard set of banner ad sizes (Medium Rectangle, Rectangle, Leaderboard, Wide Skyscraper) into a set of guidelines called the Universal Ad Package).
Benchmark: A standard by which something can be measured or judged. For example, you benchmark your Key Performance Indicators to ensure everyone in your organization is measuring performance against the same goals.
Blog/Web Logs: A self-published, managed or maintained Web diary. Usually updated daily or weekly, blogs have historically been personal, but gained notoriety after the 2004 election as an influential media outlet. Companies now use blogs to extend their brand and improve their organic search visibility.
Bounce Rate: The percentage of entrances on a web page that result in an immediate exit from the web site.
Browser: A program used to locate and view HTML documents.
Business Intelligence: While some would claim it's an oxymoron, business intelligence refers to a category of software and tools designed to gather, store, analyze, and deliver data in a user-friendly format to help organizations make more informed business decisions. Software types include dashboarding, data mining, data warehouses, and other information systems.
C
Campaign Analysis: A mesaure that tracks activity originating from a marketing campaign, so you can compare your campaigns and evaluate their effectiveness.
Client: The browser used by a website visitor.
Client Error: An error that occurs because of an invalid request by the visitor's browser.
Cloaking: In terms of search engine marketing, this is the act of getting a search engine to record content for a URL that is different than what a searcher will ultimately see. It can be done in many technical ways. Several search engines have explicit rules against unapproved cloaking. Those violating these guidelines might find their pages penalized or banned from a search engine's index. As for approved cloaking, this generally only happens with search engines offering paid inclusion program.
Click Fraud: A type of internet crime that occurs in pay per click online advertising when a person, automated script, or computer program imitates a legitimate user of a web browser clicking on an ad, for the purpose of generating a charge per click without having actual interest in the target of the ad's link.
Content Management System (CMS): a software platform that aids in the management of content on a Web site.
Contextual Link Ads/Inventory: To supplement their business models, certain text-link advertising networks (like Google) have expanded their network distribution to include "contextual inventory". Most vendors of "search engine traffic" have expanded the definition of Search Engine Marketing to include this contextual inventory. Contextual or content inventory is generated when listings are displayed on pages of Web sites (usually not search engines), where the written content on the page indicates to the ad-server that the page is a good match to specific keywords and phrases. Often this matching method is validated by measuring the number of times a viewer clicks on the displayed ad. These ads typically do not perform as well as traditional text ads on search engines, but the lower cost justifies the expense.
Conversion: An action that signifies a completion of a specified activity. For many sites, a user converts if they buy a product, sing up for a newsletter, or download a file. The conversion rate is the percentage of visitors who do convert. Cookie deletion can have an impact on your conversion rate because if a cookie is being systematically deleted, repeat visitor rates will be under-counted and new visitor rates will be over-counted, thus skewing the conversion rate metric by which you analyze your site's overall effectiveness.
Conversion Funnel: The series of steps that move a visitor towards a specified conversion event, such as an order or registration signup. See also abandonment.
Conversion Rate: The relationship between visitors to a web site and actions considered to be a "conversion," such as a sale or request to receive more information. This metric is often expressed as a percentage.
Cookie: A text file that transmits information to a data collection facility via a 1x1 pixel GIF image request and includes a tracking ID that is used to identify returning visitors. Contrary to some industry speculation, cookies can not be used for malicious use such as privacy tapping. See also first and third-party cookies.
Cost-per-Click (CPC): System where an advertiser pays an agreed amount for each click someone makes on a link leading to their web site. Also known as PPC or paid listings.
Cost-per-Thousand (CPM): System where an advertiser pays an agreed amount for the number of times their ad is seen by a consumer, regardless of the consumer's subsequent action. This term is heavily used in print, broadcasting and direct marketing, as well as with online banner ad sales. CPM stands for "cost per thousand," since ad views are often sold in blocks of 1,000. The M in CPM is Latin for thousand.
Crawler/Spider/Robot: Component of search engine that indexes web sites automatically. A search engine's crawler (also called a spider or robot), copies web page source code into its index database and follows links to other web pages.
CTR: Click Through Rate. A click through rate is the rate at which visitors "click through" from one website page or property to the next. A good indication of an ad's effectiveness.
Customer Segment: A powerful aspect of relationship marketing in which you target sub-section or group of customers who share a specific trait or set of behaviors. See also demographics and psychographics.
D
Dashboard: A web analytics dashboard provides all of your critical metrics in one place to help you understand the health or performance of your business.
Data Warehouse: is a logical collection of information gathered from many different operational databases used to create business intelligence that supports business analysis activities and decision-making tasks, primarily, a record of an enterprise's past transactional and operational information, stored in a database designed to favor efficient data analysis and reporting.
Demographics: The physical characteristics of human populations and segments of populations, often used to identify consumer markets. Demographics can include information such as age, gender, marital status, education, and geographic location. See also psychographics.
Directories: A type of search engine where listings are gathered or reviewed by humans, rather than by search engine crawlers. In directories, web sites are often reviewed, summarized in about 25 words and placed in a particular category. The largest and most popular directory site is Yahoo!
Domain: An area in the Internet specified by a URL address. The top-level domain is at the end after the dot and the second-level domain comes before it, and shows where in the top-level domain the address can be found. For example in www.xyz.com, ".com" is the top-level domain and "xyz" is the second level domain.
Domain Name: The text name that corresponds to a numeric IP address of a computer on the Internet.
Doorway/Landing/Gateway/Bridge/Jump Pages: A web page created expressly in hopes of ranking well for a term in a search engine's organic/non-paid listings and which itself does not deliver much information to those viewing it. Instead, visitors will often see only some enticement on the doorway page leading them to other pages, or they may be seamlessly redirected to a real page within the existing web site. With cloaking, visitors may never see the doorway page at all. Several search engines have guidelines against doorway pages, though they are more commonly allowed in through paid inclusion programs.
E
E-commerce: The act of selling goods and services online via a standalone site or through an online auction center.
Email Bounce: The number of e-mails that were sent but never reach the intended receiver.
Entry Page: The first viewed page on a visitor's path through a site.
Exit Page: The last page viewed on a visitor's path through a site.
F
First Party Cookie: For most business models, first-party cookies are regarded as the most reliable method to measure visitor activity. Whereas a third-party cookie is usually set by an analytics vendor, (an entity with whom the user does not have a relationship), a first-party cookie is set by the business, an organization with whom the Web site visitor has specifically chosen to do business. Because of this relationship, first-party cookies are deemed more secure by the user. Also seecookies.
Frequency: The number of times a visitor has visited a site during a reporting period. Average Frequency is the average of frequencies of all the visitors during the reporting period. Frequency is a retention metric and is part of RFM (recency, frequency, monetary) analysis.
H
Hit: Any request from a file or a web-server. A single page likely contains multiple hits as multiple image and text files are downloaded from the web-server.
Home Page: The main page of a web site. The home page provides visitors with an overview and links to the rest of the site
 I
Impression: The display of an online advertisement (usually a banner ad) to a web site visitor.
Index: The collection of information (contained in a large database) a search engine has that searchers can query against. With crawler-based search engines, the index is typically copies of all the web pages they have found from crawling the web. With human-powered directories, the index contains the summaries of all web sites that have been categorized.
Inbound/Back Link: A text or graphical hyperlink from one site to another. Google and other search engines' algorithms consider a site's popularity based on the quality and quantity of inbound links from relevant third party sites to help determine search positioning.
Internet: The Internet is the publicly accessible global system of interconnected computer networks that transmit data via a standardized Internet Protocol. See alsoWorld Wide Web.
 J
JavaScript: A scripting language developed by Netscape. While it is often used for websites, it is also used to enable scripting access to objects embedded in other applications.
K
Keyword: Terms entered into the search field of a web search engine. See also organic search and PPC.
KPI: Key Performance Indicators. Key Performance Indicators are typically kept in dashboards and provide customers with an understanding of how the site is performing.
L
Latency: The average number of days between visits for a given visitor during a reporting period. For example, those who visit on average every seven days.
Link: On a web page, text or an image that has been coded to take a browser from one page to another or from one site to another.
Log File: A file created by a web or proxy server which contains all of the access information regarding the activity on that server.
Long Tail: First coined by Chris Anderson in an October 2004 Wired magazine article to describe the niche strategy of certain business such as Amazon.com or Netflix. In relation to search engine marketing (SEM) the Long Tail refers to the keyword phrases that are highly detailed and specific and may generate low volumes of searches and traffic, but add up to generate a majority of traffic for sites with deep content or product SKUs.
LTV: Life-Time Value. Life-Time Value is a metric used to describe the value a specific customer has over the life of their relationship with you.
M
Meta Search Engine: A search engine that gets listings from two or more other search engines, rather than through its own efforts.
Meta Tags: Information placed in a web page not intended for users to see but instead which typically passes information to search engine crawlers, browser software and some other applications.
Meta Description Tag: Allows page authors to say how they would like their pages described when listed by search engines. Not all search engines use the tag.
Meta Keywords Tag: Allows page authors to add text to a page to help with the search engine ranking process.
Meta Robots Tag: Allows page authors to keep their web pages from being indexed by search engines, especially helpful for those who cannot create robots.txt files. The Robots Exclusion page provides official details.
Metrics: Metrics are a system of parameters or ways of quantitative assessment of a process that is to be measured, along with the processes to carry out such measurement. Metrics define what is to be measured.
Mobile Search: An evolving branch of information retrieval services that is centered on the convergence of mobile platforms and mobile handsets or other mobile devices. The services allow users to find mobile content interactively on mobile websites, and mobile content shows a media shift toward mobile multimedia.
Multivariate Testing: A process by which more than one component of a website may be tested in a live environment. It can be thought of in simple terms as numerous split tests or A/B tests performed on one page at the same time.
N
Navigation: The act of moving from location to location within a web site, or between web sites, accomplished by clicking on links. Navigation can also refer to the overall structure of the links on the site, comprising the paths available to the visitor.
Non-referrals: Visitors who arrive at a site by typing a domain into an address bar, using a bookmark, or clicking on an emailed URL.
O
Online Reputation Management (ORM): The act of monitoring, addressing or mitigating undesirable search engine results or mentions in online media for a company or product. Techniques include generating new content and creating posts on existing content.
OpenSearch: A collection of technologies that allow publishing of search results in a format suitable for syndication and aggregation. It is a way for websites and search engines to publish search results in a standard and accessible format. Originally developed by Amazon and recently adopted by Yahoo!, OpenSearch relies on abstract-based microformats (dataRSS, eRDF, FOAF, GeoRSS, hCard, hEvent, hReview, hAtom, MediaRSS, RDFa, XFN, etc.) to integrate syndicated content into search results.
Opt-in: This permission-based email communication requires customers to verify the opt-in method before their e-mail addresses can be used to communicate with them.
Organic/Natural Listings: Listings that search engines do not sell (unlike paid listings). Instead, sites appear solely because a search engine has deemed it editorially important for them to be included, regardless of payment. Paid inclusion content is also often considered "organic" even though it is paid for. This is because that content usually appears intermixed with unpaid organic results.
Organic Search: A type of search in which web users find sites having unpaid listings, as opposed to using the pay-per-click advertisement listings displayed among the search results.
Outbound Links: Links on a particular web page leading to other web pages on a different domain.
P
Page: A document provided by the server, including HTML, scripts, and text files. Images, sound files and video are not considered pages. Documents are defined by the system administrator, but generally include all static content, such as complete html pages. Dynamic pages are created with variables and do not exist anywhere in a static form. Forms are scripted pages which get information from a visitor that gets passed back to the server.
Page Tag: A piece of JavaScript code embedded on a web page and executed by the browser when the page is viewed. See also log files.
Page View: is generally defined as a request to load a single page of a website. On the web, a page request would result from a web surfer clicking on a link on another page that points to the page in question. See also hit.
Parameters: These are located in the URL immediately after a question mark and followed by an equal sign and a return value, known as name=value.
Path: A path is the click pattern a visitor uses as they traverse through multiple pages.
PPC: Pay Per Click or paid search uses search keywords that cost a certain amount for each customer click on that term in order to get to your site. See also organic search.
 Q
Query: A question or inquiry used to find answers about certain metrics.
Query Parameter: An individual piece of a query string consisting of a parameter name and a value for the parameter.
R
Reach: The size of the audience reading, viewing, hearing, or interacting with a message in a given period of time. Reach can be understood as either an absolute number, or a fraction of a population.
Rear-View Mirror Metrics: Metrics that measure what has occurred. For example campaign response metrics are such metrics that tell you how a campaign performed.
Recency: The number of days since a visitor's most recent visit during a reporting period.
Referrals: The location that visitors come from, particularly the sites, search engines or directories. Relationship Marketing: Relationship marketing is a type of marketing that traces its roots to direct response marketing. It emphasizes building long-term relationships with customers rather than individual transactions. It requires understanding customer needs as they go through life cycles of interacting and purchasing from organizations, and requires that marketers accurately determine customer intent in order to provide them the right message at the right time.
RFM Analysis: Recency, Frequency, Monetary analysis.
ROI: Return on Investment
Robot: An automated process that performs mundane, repeatable tasks to provide information. Search engine robots or bots provide such functions, cataloging the internet for searchers to find information.
RSS: Really Simple Syndication is a type of web syndication used by news sites and weblogs which provides summaries of information with links to the complete content.
S
Sampling: In statistics, the selection of individual observations intended to yield knowledge about a population, especially for the purposes of statistical inference.
Search Engine: A search engine is a program that helps you find information on the web.
Search Engine: Any service generally designed to allow users to search the web or a specialized database of information. Web search engines generally have paid listings and organic listings. Organic listings typically come from crawling the web, though often human-powered directory listings are also optionally offered. Top tier search engines include Google, MSN, Teoma and Yahoo!
Search Engine Marketing (SEM): The act of marketing a web site via search engines, whether this be improving rank in organic listings (search engine optimization), purchasing paid listings (PPC management) or a combination of these and other search engine-related activities (i.e. affiliate programs, shopping feeds or link development).
Search Engine Optimization (SEO): The act of altering a web site so that it does well in the organic, crawler-based listings of search engines. In the past, has also been used as a term for any type of search engine marketing activity, though now the term search engine marketing is more commonly used as an umbrella term.
Search Terms: The words (or phrase) a searcher enters into a search engine's search box. Also used to refer to the terms a search engine marketer hopes a particular page will be found for. Also called keywords, query terms or query.
Segment: A grouping of customers, defined by website activity or other data, which can be used to target them effectively.
Social Media Marketing (SMM): A form of internet marketing which seeks to achieve branding and marketing communication goals through the participation in various social media networks (MySpace, Facebook, LinkedIn), social bookmarking (Digg, Stumbleupon), social media sharing (Flickr, YouTube), review/ratings sites (ePinions, BizRate), blogs, forums, news aggregators and virtual 3D networks (SecondLife, ActiveWorlds). Each social media site can be optimized to generate awareness or traffic.
Social Media Optimization (SMO): A set of methods for generating publicity through social media, online communities and community websites. Methods of SMO include adding RSS feeds, adding a "Digg This" button, blogging and incorporating third party community functionalities like Flickr photo slides and galleries or YouTube videos. Social media optimization is a form of search engine marketing.
Spam: Any search engine marketing method that a search engine deems to be detrimental to its efforts to deliver relevant, quality search results. Some search engines have written guidelines about what they consider to be spamming, but ultimately any activity a particular search engine deems harmful may be considered spam, whether or not there are published guidelines against it. Examples of spam include the creation of nonsensical doorway pages designed to please search engine algorithms rather than human visitors, or a heavy repetition of search terms on a page to increase keyword density. .
Spider: An automated software program that gathers pages from the Internet.
Submission: The act to submitting a URL for inclusion into a search engine's index. Unless done through paid inclusion, submission generally does not guarantee listing. In addition, submission does not help with rank improvement on crawler-based search engines unless search engine optimization efforts have been implemented. Submission can be done manually (i.e., you fill out an online form and submit) or automated, where a software program or online service may process the forms behind the scenes.
Suffix: The last part of a domain that can be used to identify the type of organization or location of a site.
T
Third-party cookie: Hosted web analytics services track visitor behavior by inserting a small piece of tracking code onto each page of a site. Because the cookie is served by an analytics vendor rather than your own site, the cookie is considered third-party.
Traffic: On the web, traffic refers to the amount of data sent and received by visitors to a website.
U
URL: A Uniform Resource Locator is a means of identifying an exact location on the Internet.
User Agent: Fields in an extended web server log file identifying the browser and platform used by a visitor.
User Session: A period of activity (all hits) for one user of a website. A unique user is determined by the IP address or cookie. Typically, a user session is terminated when a user is inactive for more than 30 minutes.
Unique Visitors: refers to a measure captured by web analytics solutions that track the interaction a single user has with a website over time.
V
Viral Marketing: Any marketing technique that induces Web sites or users to pass on a marketing message to other sites or users, creating a potentially exponential growth in the message's visibility and effect.
Visitor: Similar to unique visitor, visitor refers to an individual that visits a website. A visitor or unique visitor can have multiple visits.
Visitor Session: Interaction by a site visitor. The session ends when the visitor leaves the site.
Visit: A visit is an interaction a unique visitor has with a website over a specified period of time or activity. If a visitor has left a site or has not executed a click within 30 minutes, the visit session will terminate.
W
W3C: World Wide Web Consortium develops interoperable technologies (specifications, guidelines, software, and tools) to lead the Web to its full potential.
Web 2.0: The use of World Wide Web technology and web design that aims to facilitate creativity, information sharing, and, most notably, collaboration among users. These concepts have led to the development and evolution of web-based communities and hosted services, such as social-networking sites, wikis, blogs, and folksonomies.
Web Analytics: The measurement of data as it relates to an Internet site, including the behavior of visitors, the amount of traffic, the conversion rates, web server performance, user experience, and other information in order to understand and proof of results and continually improve the results of a site towards a set of objectives.
Website: A website is a collection of web pages, on particular domain name or sub-domain on the World Wide Web on the Internet. Usually it is made up of a set of web pages created using HTML and accessible via HTTP.
What if: A type of analysis that allows an end-user to pose hypothetical situations against their data to model or predict outcomes.
World Wide Web: Also called the web, this is a global information space which people can communicate via computers connected to the Internet. Some people use "internet" and "the web" interchangeably, even though the web is a service that operates over the internet.
X
XML: Extensible Markup Language is a World Wide Web Consortium (W3C) recommended general-purpose markup language for creating special-purpose markup languages, capable of describing many different kinds of data.
XML Feeds: A form of paid inclusion where a search engine is "fed" information about pages via XML, rather than gathering that information through crawling actual pages. Marketers can pay to have their pages included in a spider based search index either annually per URL or on a CPC basis based on an XML document representing each page on the client site. New media types are being introduced into paid inclusion, including graphics, video, audio, and rich media. These feeds are commonly used for Shopping Feeds.
Z
Zero Latency: Latency is a time delay between the moment something is started, and the moment one of the effects of that event begins. When there is no time lapse between the event and the effect, it's called zero latency. In analytics, this term is used to describe instantaneous receipt of data and the ability to analyze and act on that data.
Zero-page Visit: A visit that included no page views. This is possible if a visit consisted of at least one request for a non-page file (such as a graphic) but no page files (such as .htm, .asp, .jsp, or .cfm.)