pero hace varias semanas
y aun no encuentro la solución
la idea es obtener la url de los archivos desde el código html
lo primero es obtener e href y luego concatenar con la url del html
pero e probado y buscado y no puedo solucionarlo
la manera de obtener el valor de href desde la consola
usando programas awk, grep y sed
wget http://security.debian.org/dists/stable/updates/main/binary-i386
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
<html>
<head>
<title>Index of /dists/stable/updates/main/binary-i386</title>
</head>
<body>
<h1>Index of /dists/stable/updates/main/binary-i386</h1>
<table><tr><th><img src="/icons/blank.gif" alt="[ICO]"></th><th><a href="?C=N;O=D">Name</a></th><th><a href="?C=M;O=A">Last modified</a></th><th><a href="?C=S;O=A">Size</a></th></tr><tr><th colspan="4"><hr></th></tr>
<tr><td valign="top"><img src="/icons/back.gif" alt="[DIR]"></td><td><a href="/dists/stable/updates/main/">Parent Directory</a></td><td> </td><td align="right"> - </td></tr>
<tr><td valign="top"><img src="/icons/unknown.gif" alt="[ ]"></td><td><a href="Packages.bz2">Packages.bz2</a></td><td align="right">16-Jul-2011 09:14 </td><td align="right">133K</td></tr>
<tr><td valign="top"><img src="/icons/compressed.gif" alt="[ ]"></td><td><a href="Packages.gz">Packages.gz</a></td><td align="right">16-Jul-2011 09:14 </td><td align="right">169K</td></tr>
<tr><td valign="top"><img src="/icons/unknown.gif" alt="[ ]"></td><td><a href="Release">Release</a></td><td align="right">17-Jul-2011 12:12 </td><td align="right">110 </td></tr>
<tr><th colspan="4"><hr></th></tr>
</table>
<address>Apache Server at security.debian.org Port 80</address>
</body></html>
<html>
<head>
<title>Index of /dists/stable/updates/main/binary-i386</title>
</head>
<body>
<h1>Index of /dists/stable/updates/main/binary-i386</h1>
<table><tr><th><img src="/icons/blank.gif" alt="[ICO]"></th><th><a href="?C=N;O=D">Name</a></th><th><a href="?C=M;O=A">Last modified</a></th><th><a href="?C=S;O=A">Size</a></th></tr><tr><th colspan="4"><hr></th></tr>
<tr><td valign="top"><img src="/icons/back.gif" alt="[DIR]"></td><td><a href="/dists/stable/updates/main/">Parent Directory</a></td><td> </td><td align="right"> - </td></tr>
<tr><td valign="top"><img src="/icons/unknown.gif" alt="[ ]"></td><td><a href="Packages.bz2">Packages.bz2</a></td><td align="right">16-Jul-2011 09:14 </td><td align="right">133K</td></tr>
<tr><td valign="top"><img src="/icons/compressed.gif" alt="[ ]"></td><td><a href="Packages.gz">Packages.gz</a></td><td align="right">16-Jul-2011 09:14 </td><td align="right">169K</td></tr>
<tr><td valign="top"><img src="/icons/unknown.gif" alt="[ ]"></td><td><a href="Release">Release</a></td><td align="right">17-Jul-2011 12:12 </td><td align="right">110 </td></tr>
<tr><th colspan="4"><hr></th></tr>
</table>
<address>Apache Server at security.debian.org Port 80</address>
</body></html>