[RV100 QY] intrepid alpha 6: no signal during install on AGP system

Bug #272991 reported by Jeff Trull
2
Affects Status Importance Assigned to Milestone
xserver-xorg-driver-ati
Won't Fix
Medium
xserver-xorg-video-ati (Ubuntu)
Invalid
High
Unassigned

Bug Description

On this test version of Kubuntu Intrepid, my monitor goes black after about 30 seconds. Normally I would expect the usual keyboard selection etc. to appear, but the last video I get is the boot menu itself. The monitor indicates no signal.

This happens whether I use the "install" or "live CD" options, and regardless of whether I choose the "Safe video" mode with F4 (this is usually necessary for me due to my weird monitor). So it's possible that safe video mode is broken - I'm not sure.

My graphics card is:
01:00.0 VGA compatible controller: ATI Technologies Inc Radeon RV100 QY [Radeon 7000/VE] (prog-if 00 [VGA controller])

The attached monitor is a Samsung SyncMaster 170MP, connected via VGA, at 1280x1024/58Hz and the vesa driver. Historically autodetection by the installer fails so I boot in "safe video mode" and supply my own xorg.conf later. This has worked for me through Hardy.

My motherboard is an Asus K8V-X SE with an Athlon 64 2800+ installed.

Revision history for this message
stevenschlansker (stevenschlansker) wrote :

It might be useful to indicate what hardware (esp. motherboard and video card and monitor) you are using, as well as things like the monitor refresh rate and how you are plugged in (DVI/VGA)

Jeff Trull (jetrull)
description: updated
Jeff Trull (jetrull)
description: updated
Revision history for this message
Wouter Stomp (wouterstomp-deactivatedaccount) wrote :

What changes did you have to make in hardy in xorg.conf to get it to work? Also is it possible to run lspci from the command line on that machine (e.g. using hardy) and attaching the output here? Thanks.

Revision history for this message
Jeff Trull (jetrull) wrote :

I changed my xorg.conf some time ago because I found that my particular monitor doesn't work well with DDC and certain acceleration options make my graphics card unhappy. It may have been as far back as Dapper but definitely in Feisty it was like this. My installation procedure for new releases through Hardy has been to install in "safe video mode", then reboot in safe video mode (or text mode), copy over my old xorg.conf, and reboot in normal mode.

The problem as I see it is that safe video mode install no longer works - having to do this extra work with my xorg.conf does not bother me, as long as I can install.

I have attached my

Revision history for this message
Jeff Trull (jetrull) wrote :

Last line above should read "I have attached my lspci output". Here is xorg.conf for your review. I don't think it's very interesting - the important part is replacing the driver with vesa. I wish I could remember why I needed to do this - a radeon driver bug was involved, I think. Maybe it's resolved now.

Revision history for this message
Bryce Harrington (bryce) wrote :

Can you attach your /var/log/Xorg.0.log and /var/log/Xorg.0.log.old from after attempting to boot with the -ati driver specified in xorg.conf? Also, please attach `lspci -vvnn`; the default lspci output doesn't include al lthe pci id numbers we need.

Another thing to try is disabling compiz; sometimes X boots up properly without it.

Changed in xserver-xorg-video-ati:
status: New → Incomplete
Revision history for this message
Jeff Trull (jetrull) wrote :

Thanks, Bryce. I will try to get this information for you on my current Hardy environment. Just to be certain, though, I should clarify that this is a problem with Intrepid install and I have no way of getting log files from that (to my knowledge). Having to use the vesa driver under Hardy isn't bothering me...

Revision history for this message
Jeff Trull (jetrull) wrote :

This Xorg log was produced when I specified "ati" as the driver instead of "vesa" in xorg.conf. The symptom (monitor power down with blinking LED) is identical to when I attempt to install Intrepid in safe video mode.

Revision history for this message
Jeff Trull (jetrull) wrote :

This Xorg log is the one produced when I use the "vesa" driver, i.e., when I use the xorg.conf I attached previously. In this one everything is fine.

Revision history for this message
Jeff Trull (jetrull) wrote :

The output of "lspci -vvnn" (as root)

Revision history for this message
Jeff Trull (jetrull) wrote :

Based on other bug reports I went ahead and did some experiments. I can tell you that:

1) disabling dri and glx allows me to see video, although the quality is poor
2) AGPMode appears to have no effect
3) "ignore LVDS" doesn't help either

Revision history for this message
Bryce Harrington (bryce) wrote :

Thanks for testing all those things.

One other thing to test, is to leave dri and glx enabled, but disable compiz (and easy way is to 'chmod -x /usr/bin/compiz') and see how that looks.

Changed in xserver-xorg-video-ati:
importance: Undecided → High
status: Incomplete → Confirmed
Revision history for this message
Jeff Trull (jetrull) wrote :

In fact compiz is not even installed on my machine... I'm a KDE 3.5.9 user (KDE4 video quality was terrible, the subject of a different bug). Hopefully this will change with Intrepid.

What driver is used during "safe video mode" install? I'm really not that concerned about getting radeon to work at this time, but not being able to install Intrepid is a bummer. Thanks.

Revision history for this message
Bryce Harrington (bryce) wrote :

The failsafe mode uses the vesa driver.

Revision history for this message
Jeff Trull (jetrull) wrote :

In that case this isn't really an ati bug, is it? Although I'd be delighted for that to finally work.

I filed this bug as "kubuntu-meta" thinking perhaps "safe video" (failsafe?) was not choosing the vesa driver in Intrepid.

Revision history for this message
Bryce Harrington (bryce) wrote :

Perhaps this is a case of bug 261977

Revision history for this message
Jeff Trull (jetrull) wrote :

I don't believe so. It seems that 261977 is a case of a specific nVidia driver being chosen for a device it does not support. However, there is a bit in there about the nv driver claiming a device, and preventing vesa from subsequently doing so itself... that could be related if the analogy to ati holds.

In other news I have the radeon driver *almost* working - by removing the Modes information and disabling only "glx", I get a good image at the proper resolution, but with a bad flicker event about every 10 seconds. xorg.conf attached if you have any suggestions - successful vesa config is exactly that with driver changed to "vesa" and no "AccelMode EXA". I am up and running with Intrepid Beta now via the alternate install CD (text mode).

Revision history for this message
Frédéric Kerneuzet (contact-kerneuzet) wrote :

I got exactly the same problem (I think) with my ATI Radeon 3850 (not fixed under final version), but my bug has been tagged as "duplicate" of the "nv bug"...

Revision history for this message
Bryce Harrington (bryce) wrote :

261977 was actually a problem in the xserver's logic for selecting the driver for a card. Basically, there was a routine that looked up the card's pci id to see what driver to use, and if it didn't find one - meaning the driver didn't support the card, it left it blank. Later on, there was some other (older) logic that kicked in if no driver had been selected, that picked the driver based on the vendor name, -nv for nVidia hardware, -ati for ATI hardware. It tends to afflict -nv more since there's a lot more hardware -nv doesn't support, but in theory it could also affect -ati.

Anyway... comment #14 sort of made me think of that bug. But since that problem is fixed, this is obviously something completely different.

Great to hear that you were able to get it to mostly work with EXA and disabling glx. I'm curious if EXA is really needed, or if it also works the same with XAA (the default)? We are considering switching to EXA as the default for Jaunty (see https://blueprints.edge.launchpad.net/ubuntu/+spec/radeon-change-xaa-to-exa), so I'm curious both if it fixes this issue, and if the flickering could be due to EXA.

Revision history for this message
Bryce Harrington (bryce) wrote :

Aha, I should have noticed this sooner - you've got an AGP system, and your symptoms almost exactly match a known issue with -ati on AGP. Please see https://wiki.ubuntu.com/X/Quirks#ATI AGP Mode Quirk

Please try this. In your xorg.conf, set the following:

    Option "AGPMode" "1"

Also try values 2, 4, 8. Sometimes your BIOS will include AGP settings - make sure it is set to the factory default for this testing (since we want to find a combo that works "out of the box").

Changed in xserver-xorg-video-ati:
status: Confirmed → Incomplete
Revision history for this message
Jeff Trull (jetrull) wrote : Re: intrepid alpha 6: no video during install on AGP system

Thanks for the suggestions, but it looks like specifying EXA vs. not specifying it makes no difference to my observations, nor do the AGPMode settings.

As for bug 261977, was it fixed after alpha 6? I haven't retried the "safe video" install since then. My current Intrepid build is off the alternate Beta CD plus online updates.

Revision history for this message
In , Bryce Harrington (bryce) wrote :

Created an attachment (id=20770)
xorg.conf

Forwarding this bug from a Ubuntu reporter:
https://bugs.edge.launchpad.net/ubuntu/+source/xserver-xorg-video-ati/+bug/272991

[Problem]
Monitor goes black (no signal) after 30 seconds, unless the glx module is disabled. With glx disabled, display works with proper resolution but has a bad flicker every 10 sec.

Fiddling with AGPMode, EXA, DRI, and other parameters didn't have an effect.

[lspci]
01:00.0 VGA compatible controller [0300]: ATI Technologies Inc Radeon RV100 QY [Radeon 7000/VE] [1002:5159] (prog-if 00 [VGA controller])
 Subsystem: ATI Technologies Inc Unknown device [1002:053a]

Monitor: Samsung SyncMaster 170MP (VGA)
Motherboard: Asus K8V-X SE
Processor: Athlon 64 2800+

[Original Report]
On this test version of Kubuntu Intrepid, my monitor goes black after about 30 seconds. Normally I would expect the usual keyboard selection etc. to appear, but the last video I get is the boot menu itself. The monitor indicates no signal.

This happens whether I use the "install" or "live CD" options, and regardless of whether I choose the "Safe video" mode with F4 (this is usually necessary for me due to my weird monitor). So it's possible that safe video mode is broken - I'm not sure.

The attached monitor is a Samsung SyncMaster 170MP, connected via VGA, at 1280x1024/58Hz and the vesa driver. Historically autodetection by the installer fails so I boot in "safe video mode" and supply my own xorg.conf later. This has worked for me through Hardy.

I changed my xorg.conf some time ago because I found that my particular monitor doesn't work well with DDC and certain acceleration options make my graphics card unhappy. It may have been as far back as Dapper but definitely in Feisty it was like this. My installation procedure for new releases through Hardy has been to install in "safe video mode", then reboot in safe video mode (or text mode), copy over my old xorg.conf, and reboot in normal mode.

Based on other bug reports I went ahead and did some experiments. I can tell you that:

1) disabling dri and glx allows me to see video, although the quality is poor
2) AGPMode appears to have no effect
3) "ignore LVDS" doesn't help either

I have the radeon driver *almost* working - by removing the Modes information and disabling only "glx", I get a good image at the proper resolution, but with a bad flicker event about every 10 seconds. xorg.conf attached if you have any suggestions - successful vesa config is exactly that with driver changed to "vesa" and no "AccelMode EXA". I am up and running with Intrepid Beta now via the alternate install CD (text mode).

Revision history for this message
In , Bryce Harrington (bryce) wrote :

Created an attachment (id=20771)
Xorg.0.log

Bryce Harrington (bryce)
Changed in xserver-xorg-video-ati:
status: Incomplete → Confirmed
Revision history for this message
Bryce Harrington (bryce) wrote : Re: intrepid alpha 6: no signal during install on AGP system

Jeff,

Yes, it was fixed fairly late in the release, post-beta. However if you've done the online updates you should have all the necessary bits and pieces. I'd appreciate it if you could re-try the safe video option, just to verify that issue is gone.

However, the -ati issues are the ones I'm more interested in getting squared away.

I've gone ahead and forwarded this issue upstream to https://bugs.freedesktop.org/show_bug.cgi?id=18871. Please subscribe to that bug report, in case upstream has additional questions or wishes you to test something. Thanks ahead of time!

Meanwhile, I've written a short troubleshooting guide for 'no signal' bugs, and it would be helpful if you could run through the steps, to see if you can get a functioning system at a lower resolution or something with glx enabled. If so, then that hints that there is an improper modeline being setup for your monitor.

Also, have you ever tested this with a different monitor? It would be a useful data point if the issue was seen to be specific to that monitor, vs. if it occurs with other monitors.

Changed in xserver-xorg-video-ati:
status: Confirmed → Triaged
Changed in xserver-xorg-driver-ati:
status: Unknown → Confirmed
Revision history for this message
Jeff Trull (jetrull) wrote :

Thanks for the helpful suggestions and for forwarding my bug upstream. I'll be happy to participate in debugging as much as I can :) I don't have easy access to a second monitor, but can try to borrow one from work. I will also try installing from the released Intrepid CD.

I wasn't able to locate your troubleshooting guide - can you attach a link?

Thanks,
Jeff

Revision history for this message
Jeff Trull (jetrull) wrote :

I can now confirm that "safe graphics" mode is working on the desktop release CD.

Revision history for this message
Steve Beattie (sbeattie) wrote :

This bug was reported in the Intrepid development cycle; removing regression-potential and marking as regression-release.

Revision history for this message
Jeff Trull (jetrull) wrote :

I finally got my hands on another monitor per Bryce's suggestion. The behavior is the same with one exception: the periodic flicker goes away. With the new monitor my observation is simply that enabling "glx" causes a blank screen and "RADEON(0): Idle timed out" messages in the log, while disabling it causes everything to work fine. As before, AccelMode and AGPMode have no effect. Could there be a glx issue with the RV100 hardware?

Revision history for this message
Jeff Trull (jetrull) wrote :

I've found another person with my same problem:

http://ubuntuforums.org/showthread.php?t=939603

I messaged him privately to see what happens if he disables "glx" and he confirms that it makes things work for him again.

So I think this is a general issue...

Revision history for this message
In , Jeff Trull (jetrull) wrote :

I was able to test on a different monitor and the flicker-when-glx-disabled disappeared, but the black screen with glx enabled persists. In addition, I found another user with the same problem:

http://ubuntuforums.org/showthread.php?t=939603

He also reports black screen and "Idle timed out, resetting engine..." messages. I emailed him privately and he confirmed that disabling glx cured his problem. So I think there is a general issue with glx and Radeon 7000/RV100 cards and this driver.

Thanks,
Jeff
(I am the original Ubuntu reporter)

Revision history for this message
In , Michel Dänzer (michel-daenzer) wrote :

I assume what really makes the difference is disabling the DRI, not GLX? Did you try Option "DRI" "off" to disable the DRI? (The "dri" module will be autoloaded by current X servers)

If that's the case, does the DRI work with Option "BusType" "PCI"?

Revision history for this message
In , Jeff Trull (jetrull) wrote :

Disabling either "glx" or "dri" in the Modules section allows me to use the driver successfully. Leaving both enabled and using BusType PCI per your suggestion also appears to work (and gives me those nice effects I've heard so much about, for the first time). Do these symptoms suggest anything? Do I have a setting wrong somewhere? Thanks.

Revision history for this message
In , agd5f (agd5f) wrote :

(In reply to comment #4)
> Disabling either "glx" or "dri" in the Modules section allows me to use the
> driver successfully. Leaving both enabled and using BusType PCI per your
> suggestion also appears to work (and gives me those nice effects I've heard so
> much about, for the first time). Do these symptoms suggest anything? Do I
> have a setting wrong somewhere? Thanks.
>

It means AGP isn't working on your system. Generally this means a bad AGP mode.

Can you try the AGPMode option again? e.g.,
Option "AGPMode" "1"

Please also try:
Option "AGPMode" "2"

Revision history for this message
In , Jeff Trull (jetrull) wrote :

OK, I've retried, and it's still the case that none of the "AGPMode" variants work. I also tried restoring the BIOS defaults, and then varying the AGP multipliers in both BIOS and xorg.conf, to no avail. I added changing "AccelMethod" "EXA" into the mix but that made no difference. Finally I noticed a comment somewhere on the web regarding aperture size. Although my BIOS aperture size setting was 64M, a kernel message (from agpgart-amd64) indicated it was 32M. Unfortunately changing the BIOS aperture size to 32M, and then subsequently to 128M, made no difference either.

Revision history for this message
Bryce Harrington (bryce) wrote : Re: intrepid alpha 6: no signal during install on AGP system

radeontool in Jaunty* can be used to assist in debugging this issue. After installing it, you run it like this:
    radeontool regmatch '*' > regdump_good.txt
    radeontool regmatch '*' > regdump_broke.txt

Run it two times. Once when you have a working screen (for any driver), and once in the broken case (either from the tty console or logged into the sick box remotely). Attach both of those to this bug report, and we can then forward this issue upstream. Thanks ahead of time!

For more info see https://wiki.ubuntu.com/X/Troubleshooting/BlankScreen

[* Note: radeontool is available in earlier versions of Ubuntu but is too old; if you need to run it on an older version of Ubuntu, you can obtain and build it from the upstream git tree at http://cgit.freedesktop.org/~airlied/radeontool/]

Revision history for this message
Jeff Trull (jetrull) wrote :

Bryce, thanks for your response. I built radeontool from source via git clone on my Intrepid system to avoid installation complexities. Results are attached.

The first file is the radeontool output with the vesa driver (clean visual, no glitching, no blank screen).

Revision history for this message
Jeff Trull (jetrull) wrote :

This second file is radeontool output with the ati driver but glx disabled (screen visible but glitches every 10 seconds).

Revision history for this message
Jeff Trull (jetrull) wrote :

This final radeontool output is with the ati driver and glx enabled (black screen). Thanks for your help.

Revision history for this message
In , Jeff Trull (jetrull) wrote :

Created an attachment (id=22862)
register settings from "radeontool" for the passing (no glx enabled) case

At Bryce's suggestion I built "radeontool" and am supplying the register settings found when glx is disabled (and the screen looks OK) and when it is enabled (blank screen, next attachment).

Revision history for this message
In , Jeff Trull (jetrull) wrote :

Created an attachment (id=22863)
radeontool register dump for glx enabled (black screen)

Revision history for this message
In , agd5f (agd5f) wrote :

It looks like AGP is just plain busted with your combination of card and motherboard. setting the bustype to pci is the right fix.

Revision history for this message
In , Bryce Harrington (bryce) wrote :

> It looks like AGP is just plain busted with your combination of card and
> motherboard. setting the bustype to pci is the right fix.

Is this something that could be detected and fixed/quirked in the driver itself? I'd like to avoid having to require users to set this in their xorg.conf since we're trying to move away from that in general, but if it's not possible I can just document it I guess.

Revision history for this message
In , agd5f (agd5f) wrote :

(In reply to comment #10)
> > It looks like AGP is just plain busted with your combination of card and
> > motherboard. setting the bustype to pci is the right fix.
>
> Is this something that could be detected and fixed/quirked in the driver
> itself? I'd like to avoid having to require users to set this in their
> xorg.conf since we're trying to move away from that in general, but if it's not
> possible I can just document it I guess.
>

You could add a pci mode quirk list just like you did for agp. maybe add it to RADEONPreInitChipType() in radeon_driver.c where we detect the bus type.

Revision history for this message
In , Jeff Trull (jetrull) wrote :

Created an attachment (id=22871)
log from X running with AMD64 kernel

I have some good news - this may be a kernel bug. I found this:

https://bugs.launchpad.net/ubuntu/+source/linux-source-2.6.15/+bug/29813

And followed its advice to install an AMD64 kernel. Aperture references in the kernel log changed from this (i686):

[ 18.447151] agpgart-amd64 0000:00:00.0: AGP aperture is 32M @ 0xe8000000

to this (x86_64):

[ 0.004000] Checking aperture...
[ 0.004000] Aperture from AGP @ e8000000 old size 32 MB
[ 0.004000] Aperture from AGP @ e8000000 size 128 MB (APSIZE f20)
[ 0.004000] Node 0: aperture @ 8e8000000 size 32 MB
[ 0.004000] Aperture beyond 4GB. Ignoring.
[ 0.575494] agpgart-amd64 0000:00:00.0: AGP aperture is 128M @ 0xe8000000

and I got video with glx!

working Xorg.log attached.

Revision history for this message
In , Bryce Harrington (bryce) wrote :

Well, switching to 64 bits seems like more of a workaround than a solution, so perhaps the pci quirk stuff is still necessary. I'll put that on my todo list, but I'm fairly swamped (as I'm sure everyone is) so can't say when I'll be able to get to it; if anyone else wants to give it a shot, please don't wait on me.

Revision history for this message
In , Jeff Trull (jetrull) wrote :

I think I found the kernel bug - aperture size is calculated by different code if in 64b mode (arch/x86/kernel/aperture_64.c) versus 32b mode (drivers/char/agp/amd64-agp.c). There's a section of very similar (possibly cut+paste then modified) code in the two files but in the 32b version an early exit is taken. I removed this early exit so the aperture size would be recalculated and got the correct value. Now running glx in 32b mode with my hacked kernel:

OpenGL renderer string: Mesa DRI Radeon 20061018 AGP 4x x86/MMX+/3DNow!+/SSE2 NO-TCL

I'm off to make a kernel bug report. Thanks for your help.

Revision history for this message
In , Bryce Harrington (bryce) wrote :

Great, good sleuthing tracking that down. Since it sounds like there is not an X problem here, I guess we can close these bugs. But feel free to reopen if there is additional X work to be done.

Revision history for this message
Bryce Harrington (bryce) wrote :

As per the upstream bug comments this seems to be a kernel issue rather than X, so closing.

Changed in xserver-xorg-video-ati:
status: Triaged → Invalid
Changed in xserver-xorg-driver-ati:
status: Confirmed → Invalid
Changed in xserver-xorg-driver-ati:
importance: Unknown → Medium
status: Invalid → Won't Fix
Changed in xserver-xorg-driver-ati:
importance: Medium → Unknown
Changed in xserver-xorg-driver-ati:
importance: Unknown → Medium
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.