summaryrefslogtreecommitdiffstats
path: root/chrome/tools/webforms_extractor.py
Commit message (Collapse)AuthorAgeFilesLines
* Fix python scripts in src/chrome/maruel@chromium.org2011-11-271-253/+254
| | | | | | | | | | | | | | | | | | | | Make sure that: - shebang is only present for executable files - shebang is #!/usr/bin/env python - __main__ is only present for executable files - file's executable bit is coherent Also fix EOF LF to be only one. Minor python style fixes. TBR=nirnimesh@chromium.org BUG=105108 TEST= Review URL: http://codereview.chromium.org/8680018 git-svn-id: svn://svn.chromium.org/chrome/trunk/src@111658 0039d316-1c4b-4281-b951-d872f2087c98
* Improve the webforms extractor script.dyu@chromium.org2011-07-201-0/+253
| | | | | | | | | | | Removing duplicate code by adding command line options to either remove js elements or all non form elements. Add regex pattern to extract the form location comment so the extracted from has the location of the originating form. TEST=none BUG=none Review URL: http://codereview.chromium.org/7381005 git-svn-id: svn://svn.chromium.org/chrome/trunk/src@93247 0039d316-1c4b-4281-b951-d872f2087c98
* Improved the webforms extractor script.dyu@chromium.org2011-07-081-205/+0
| | | | | | | | | | | | | Renamed the original extractor script by appending a '_js' to indicate the script is used to strip all JS elements. The new extractor script extracts all forms from a HTML file and saves into each file. If there are multiple forms per downloaded site, mutiple files are created for each form. TEST=none BUG=none Review URL: http://codereview.chromium.org/7248046 git-svn-id: svn://svn.chromium.org/chrome/trunk/src@91892 0039d316-1c4b-4281-b951-d872f2087c98
* Minor improvments in regex and added Verbose mode for expressing the regex.dyu@chromium.org2011-06-071-15/+50
| | | | | | | | BUG=none TEST=none Review URL: http://codereview.chromium.org/7044036 git-svn-id: svn://svn.chromium.org/chrome/trunk/src@88217 0039d316-1c4b-4281-b951-d872f2087c98
* Script that extract forms from HTML files. Used in conjunction with the ↵dyu@chromium.org2011-06-031-0/+170
aggregator script that collects web pages with filliable forms. webforms_extractor.py TEST=none BUG=none Review URL: http://codereview.chromium.org/6735043 git-svn-id: svn://svn.chromium.org/chrome/trunk/src@87887 0039d316-1c4b-4281-b951-d872f2087c98