Location: PHPKode > scripts > ArAutoSummarize > about.html
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">

<head>
<meta http-equiv="Content-Type" content="text/html;charset=utf-8" />
<meta name="description" content="Block template has been developed to be used in RAMA CMS" />
<meta name="keywords" content="xhtml, css, html, printing, template" />
<link rel="stylesheet" type="text/css" href="style.css" media="all" />
<title>Arabic Auto Summarize Class</title>
</head>

<body>

<center>
<div style="width: 80%; position:absolute; left:10%; top:0%; z-index:1">
<br />
<div class="tabArea" align="center">
  <a class="tab" href="example.php">Example</a>
  <a class="tab" href="about.html">About</a>
</div>

<div class="Paragraph">

<h2>Arabic Auto Summarize Class</h2><br />

<div class="img-shadow">
<img src="images/thumb.gif" alt="Khaled Al-Sham'aa" border="0" width="260" height="277" />
</div>

<span class="FirstChar">T</span>his class identifies the key points in an Arabic document
for you to share with others or quickly scan. The class determines key points 
by analyzing an Arabic document and assigning a score to each sentence. Sentences
that contain words used frequently in the document are given a higher score. You
can then choose a percentage of the highest-scoring sentences to display in the
summary. "ArAutoSummarize" class works best on well-structured documents
such as reports, articles, and scientific papers.
<p>"ArAutoSummarize" class cuts wordy copy to the bone by counting 
  words and ranking sentences. First, "ArAutoSummarize" class identifies 
  the most common words in the document (barring &quot;هو&quot;, &quot;هي&quot;, 
  &quot;في&quot;, &quot;حتى&quot;, &quot;من&quot; and the like) and assigns
  a &quot;score&quot; to each word--the more frequently a word is used, the higher 
  the score.</p>
<p>Then, it &quot;averages&quot; each sentence by adding the scores of its words 
  and dividing the sum by the number of words in the sentence--the higher the 
  average, the higher the rank of the sentence. "ArAutoSummarize"
  class can summarize texts to specific number of sentences or percentage of the 
  original copy.</p>
<p>We use statistical approach, with some attention apparently paid to: <br />
<ul>
  <li><span class="subTitle">Location:</span> leading sentences of paragraph/document, title, introduction, and
  conclusion.</li>
  <li><span class="subTitle">Fixed phrases:</span> "خصوصا", "نتيجة", "خلاصة", "تحقيقات", "هام", in-text
  summaries, etc.</li>
  <li><span class="subTitle">Frequencies</span> of words, phrases, proper names</li>
  <li><span class="subTitle">Contextual material:</span> query, title, headline, initial paragraph</li>
</ul>
</p>
<p>The motivation for this class is the range of applications for key phrases:</p>
<ul>
<span class="Quote">
The point of the list is that there are many uses for key phrases,
  so a class for automatically generating good key phrases should have a sizable
  market.
</span>
<li><span class="subTitle">Mini-summary:</span> Automatic key phrase extraction can provide a quick mini-summary
  for a long document. For example, it could be a feature in a web sites; just
  click the summarize button when browsing a long web page.</li><br />
<li><span class="subTitle">Highlights:</span> It can highlight key phrases in a long document, to facilitate
  skimming the document.</li><br />
<li><span class="subTitle">Author Assistance:</span> Automatic key phrase extraction can help an author or editor
  who wants to supply a list of key phrases for a document. For example, the administrator
  of a web site might want to have a key phrase list at the top of each web page.
  The automatically extracted phrases can be a starting point for further manual
  refinement by the author or editor.</li><br />
<li><span class="subTitle">Text Compression:</span> On a device with limited display capacity or limited bandwidth,
  key phrases can be a substitute for the full text. For example, an email message
  could be reduced to a set of key phrases for display on a pager; a web page
  could be reduced for display on a portable wireless web browser.</li><br />
</ul>
<p>This list is not intended to be exhaustive, and there may be some overlap in 
  the items.
</p>

</div>
<br />
</div>
</center>
          <script src="http://www.google-analytics.com/urchin.js" type="text/javascript">
          </script>
          <script type="text/javascript">
          _uacct = "UA-1268287-1";
          urchinTracker();
          </script>
</body>
</html>
Return current item: ArAutoSummarize